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Abstract. Let P:FxF— > F be a polynomial of bounded degree over a 
finite field F of large characteristic. In this paper we establish the following 
dichotomy: either P is a moderate asymmetric expander in the sense that 
\P(A,B)\ > |F| whenever A, B C F arc such that \A\\B\ > CIFp-^S for 
a sufficiently large C, or else P takes the form P(x,y) = Q(F(x) + G(y)) 
or P(x,y) = Q(F(x)G(y)) for some polynomials Q,F,G. This is a reason- 
ably satisfactory classification of polynomials of two variables that moderately 
expand (either symmetrically or asymmetrically). We obtain a similar classi- 
fication for weak expansion (in which one has \P{A,A)\ > l^l 1 / 2 ^! 1 / 2 when- 
ever \A\ > C|F| 1 ~ 1/ ' ie ), and a partially satisfactory classification for almost 
strong asymmetric expansion (in which \P(A, B)\ = (1 — 0(|F| — C ))|F| when 
|j4[, |B| > |F| 1— c for some small absolute constant c > 0). 

The main new tool used to establish these results is an algebraic regularity 
lemma that describes the structure of dense graphs generated by definable 
subsets over finite fields of large characteristic. This lemma strengthens the 
Szemeredi regularity lemma in the algebraic case, in that while the latter 
lemma decomposes a graph into a bounded number of components, most of 
which are e-regular for some small but fixed e, the latter lemma ensures that 
all of the components are 0(|F| — 1 /^-regular. This lemma, which may be of 
independent interest, relies on some basic facts about the etale fundamental 
group of an algebraic variety. 



1. Introduction 

1.1. Expanding polynomials. Let F be a finite field, let k > 1 be an integer, 
and let P : F fe — >• F be a polynomial of k variables defined over F. We will be 
interested in the regime when the order |F| of F is largeQ,but k and the degree of 
P remains bounded; one could formalise this by workings with a sequence F n of 
fields whose order is going to infinity, and a sequence P n : F^ — > F n of polynomials 
of uniformly bounded degree on each of these fields, where k is independent of n. 
But in the discussion that follows we will suppress the dependence on the sequence 
parameter n to simplify the exposition. (Later on, we will use the formalism of 
nonstandard analysis to make this suppression of n more precise.) Given k subsets 
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4n fact, the new results of this paper will be restricted to the regime in which the characteristic 
of F is large, and not just the order. 

2 This is analogous to how the concept of an expander graph does not, strictly speaking, apply 
in any non-trivial sense to a single (standard) graph, but should instead be applied to a sequence 
of such graphs, or to a single nonstandard graph. 
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j4i, . . . , A k of P, we may form the set 

P{A U ...,A k ):= {P(ai, . . . , a k ) | ffli € A u . . . , a k € A k }. 

One of the main objectives of this paper is to study the expansion properties of P, 
which informally refers to the phenomenon that for "typical" polynomials P and 
"non-trivial" A\, . . . , A k , the set P(A±, . . . , Ak) tends to be signihcantly larger than 
any of the A\,..., A k . We will focus in particular on the following five concepts in 
increasing order of strength, essentially following the notation from 36 : 

(1) We say that P is a weak expander if there are absolute constants c, C > 
such that \P(A,...,A)\ > C-^A^-^F] whenever A C F and \A\ > 
CIFI 1 ^. 

(2) We say that P is a moderate expander if there are absolute constants c, C > 
such that \P(A,...,A)\ > C' 1 ^ whenever A C F and \A\ > ClFl 1 ^. 

(3) We say that P is a almost strong expander if there are absolute constants 
c,C > such that \P(A,...,A)\ > |F| - C|F| 1 - C whenever A C F and 
\A\,...,\A\>C\F\^ C . 

(4) We say that P is a strong expander if there are absolute constants c, C > 
such that \P(A, ...,A)\ > |F| - C whenever A C F and \A\, . . . , \A\ > 
CIFI 1 -^ 

(5) We say that P is a very strong expander if there are absolute constants 
c, C > such that P(A, ...,A)=F whenever A C F and \A\, . . . , \A\ > 
CIFI 1 -^ 

As noted previously, these notions are trivial in the setting of a fixed field F 
and polynomial P, but acquire non-trivial meaning when these objects are allowed 
to depend on some parameter n. It is certainly also of interest to understand 
expansion when the sets A%, . . . , A k are small (as opposed to having cardinality at 
least C|F| 1 ~ C ), but we have nothing new to say about this case and will not discuss 
it further here, and refer the interested reader to [35] and [7] for a survey of the 
situation. 

In this paper, we will also consider the asymmetric case when the sets involved 
are distinct: 

(1) We say that P is a weak asymmetric expander if there are absolute con- 
stants c,C* > such that \P(Ai,...,A k )\ > C' 1 min(|Ai|, . . . , |A fc |) 1 ~ c |F| c 
whenever . . . , \A k \ > ClFl 1 ^. 

(2) We say that P is a moderate asymmetric expander if there are absolute con- 
stants c, C > such that \P(A 1 ,.. .,A k )\ > C~ X \F\ whenever . , \A k \ > 
CIFI 1 ^. 

(3) We say that P is a almost strong asymmetric expander if there are absolute 
constants c, C > such that \P(A 1 , . . . , A k )\ > \F\ - C|F| 1_C whenever 
\A x \,,..,\A k \>C\F\ 1 -. 

(4) We say that P is a strong asymmetric expander if there are absolute con- 
stants c, C > such that \P(Ai, . . -,A k )\ > \F\-C whenever |Ai|, . . . , \A k \ > 
CIFI 1 ^. 

(5) We say that P is a very strong asymmetric expander if there are absolute 
constants c, C > such that P(Ai, . . . , A k ) — F whenever \Ai\, . . . , \A k \ > 
C|F|^ C . 

Clearly, any of the asymmetric expansion properties implies the symmetric coun- 
terpart; for instance, moderate asymmetric expansion implies moderate expansion. 
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When k = 1, P cannot be an expander in any of the above senses, thanks to 
the trivial inequality |-P(A)| < \A\. For k > 2, there are some obvious examples of 
non-expanders. For instance, when k = 2, the polynomial P(x, y) :— x + y is not 
an expander in any of the above senses (in the limit when |F| goes to infinity), as 
can be seen by setting A\ = A 2 equal to an arithmetic progression. In a similar 
vein, P{x, y) :— xy is not an expander as one can set A\ = A 2 equal to a geometric 
progression. More generally, if P takes the additive form 

(1) P(x 1 ,x 2 )=Q(F 1 {x 1 )+F 2 (x 2 )) 
or the multiplicative form 

(2) P[x 1 ,x 2 ) = Q(F 1 (x 1 )F 2 (x 2 )) 

for some polyomials Q,Fi,F 2 : F — > F of bounded degree, then P will not be an 
expander in any of the asymmetric senses, as can be seen by taking A; = F^ 1 (Ei) 
for i = 1,2, where E\,E 2 are randomly chosen arithmetic (resp. geometric) 
progressions of fixed length L of equal spacing (resp. ratio). By setting in- 
stead A := F^ 1 {E 1 ) n F 2 ~ 1 (E 2 ), we see from the first moment method that we 
can find length L progressions E\,E 2 of equal spacing with \A\ > L 2 /|F| (resp. 
\A\ > L 2 /(|F| — 1)) in the additive (resp. multiplicative) case; taking L close to 
|F|, we conclude that such polynomials cannot be moderate expanders (although 
this argument is not strong enough rule out weak expansion, unless F\ — F 2 ). A 
construction in 32 s also shows that no polynomial of two variables can be a strong 
expander. 

On the other hand, by using estimates related to the sum-product phenome- 
non, there are several results in the literature establishing various sorts of expan- 
sion for certain classes of polynomials. We will only give a sample of the known 
results here (focusing exclusively on the regime of large subsets of F), and re- 
fer the reader to [36] and [7] for a more comphensive survey of results. Soly- 
mosi [58] used graph-theoretic methods to establish weak expansion for polyno- 
mials of the form P(x\,x 2 ) = f{x\) + x 2 when / was a nonlinear polynomial of 
bounded degree; his results also show weak asymmetric expansion for polynomials 
of the form P(x\, x 2 , X3) = f{xi) + x 2 + X3. These results were generalised in 
[36] (by a Fourier-analytic method), establishing for instance weak expansion for 
P(xi,x 2 ) — f{x\) + g(x 2 ) for non-constant polynomials /, g whose degrees are dis- 
tinct and less than the characteristic of F, with a similar result for polynomials of 
the form P{x\,x 2 ) — f{x\)g{x 2 ). In [57], Shkredov established very strong expan- 
sion for the polynomial P(x%, x 2 , x%) — x\ +x\x 2 + X3, and moderate expansion for 
P{x\, x 2 ) — x\ [x\ +x 2 ), while in [22] weak expansion for P{x\,x 2 ) := x\ (x 2 + 1) in 
fields of prime order was established. As a consequence of their results on the finite 
field distinct distances problem, Iosevich and Rudnev [39] established the strong 
expansion of polynomials of the form P(x\ , . . . } xj, yi, ■ ■ ■ , Vd) '■= Si=i i x i ~ Vi) 2 
for any d > 2, and in a similar spirit Vu |66] established^) the moderate expansion 
of polyomials of the form P(x\, x 2 , yi,y 2 ) = f{x\ — yi, x 2 — y 2 ) for any symmetric 
polynomial / of bounded degree which was non- degenerate in the sense that / is not 
of the form f(xi,x 2 ) — Q{ax\ + bx 2 ) for some polynomial Q and constants a, b. In 
[55], moderate expansion for polynomials of the form P(xi,x 2 ) = f(x\) + x\g(x 2 ) 



^In fact, the result in |66l yields a stronger lower bound on expressions such as \A + A\ + 
\}{A,A)\. 
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was established when f(xx) is affincly independent of x\, improving upon earlier 
work of Bourgain [3j. In [7], it was shown that any polynomial P{xi 1 x 2 ) that is 
not of the form Fi(x\) + F 2 {x 2 ) or Fi(xi)F 2 (x 2 ), is monic in each of the two vari- 
ables Xx,X2, and is non-composite in that it is not of the form P = Q o R for some 
polynomials functions Q : F — > F, R : F — > F over the algebraic completion of F 
with Q non-linear, then P is a weak asymmetric expander. 

We can now present our first set of new results regarding expansion, in the 
context of polynomials of two variables in a field of large characteristic. We first 
give the formulation that pertains to moderate asymmetric expansion. 

Theorem 1 (Moderate asymmetric expansion). For any degree d, there exists 
a constant C such that the following statement holds. Let F be a finite field of 
characteristic at least C, and let P : F x F — > F be a polynomial of degree at most 
d. Then at least one of the following statements hold: 

(i) (Additive structure) One has 

(3) P(x u x 2 ) = Q{F 1 (x 1 ) + F 2 {x 2 )) 

(as a polynomial identity in the indeterminates Xi,x 2 ) for some polynomials 
Q,F U F 2 :F^F. 

(ii) (Multiplicative structure) One has 

(4) P(x u x 2 ) = Q{F 1 {x 1 )F 2 {x 2 )) 

for some polynomials Q, Fx, Fi : F — > F. 

(iii) (Moderate asymmetric expansion) One has 

\P(A 1: A 2 )\ > C-^FI 
whenever A x , A 2 are subsets ofF with |j4i||A 2 | > C|F| 2 ~ 1/8 . 

The degree of Q, Fx, F 2 is not specified in (i), (ii), but it is easy to see that one can 
restrict to the case when Q, F\,F 2 have degree at most d with no loss of generality, 
since (except in degenerate cases when one of Q, Fi, F 2 is constant) it is not possible 
to have either (i) or (ii) hold if Q, Fx, or F 2 has degree greater than d. The exponent 
1/8 appearing in the above theorem is an artefact arising from the number of times 
we were forced to apply the Cauchy-Schwarz inequality in our arguments, and we 
do not believe it to be optimal. When option (iii) occurs, we are also able to obtain 
additional bounds on the set {(01,02,03) e Ai x A 2 x A 3 : P(ai,a 2 ) — 03} for 
various sets Ax,A 2 ,A 3 ; see Remark below. 

Thus, we see that in the large characteristic case, the only polynomials in two 
variables that are not moderate asymmetric expanders are the polynomials given 
by the examples ([T]), ([2]), which as discussed previously are not weak asymptotic 
expanders or moderate expanders. In particular, this shows that there is no dis- 
tinction between moderate asymmetric expansion, moderate expansion, and weak 
asymmetric expansion, at least for polynomials of two variables in the large charac- 
teristic case. This result partially addresses a conjecture of Bukh and Tsimerman 
[7[ §9] and of Vu [5H1 Problem 4] , at least in the case of large characteristic and 
reasonably dense sets A, and it seems likely that the methods here could be used 
to make further progress on these conjectures. We remark that some analogous 
results, in which the finite field F was replaced by the real field E, the complex 
field C, or the rationals Q were obtained in [T7], [IE], and [53] respectively, using 
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very different methods (related to the Szcmcrcdi- Trotter theorem) from those used 
here. 

Theorem 1581 will be established at the end of Section [7] By combining the above 
results with the Fourier-analytic arguments in |36j , we can obtain a similar criterion 
for weak expansion: 

Theorem 2 (Weak expansion). For any degree d, there exists a constant C such 
that the following statement holds. Let F be a finite field of characteristic at least 
C , and let P : F x F — > F be a polynomial of degree at most d. Then at least one 
of the following statements hold: 

(i) (Additive structure) One has 

P{x x ,x 2 ) = Q{aF{ Xl )+bF{x 2 )) 

for some polynomials Q, F : F — >■ F, and some elements a, b S F. 

(ii) (Multiplicative structure) One has 

P(x 1 ,x 2 )=Q(F(x 1 ) a F(x 2 ) b ) 

for some polynomials Q,F : F — > F, and some natural numbers a,b with 
a,b<C. 
(hi) (Weak expansion) One has 

\P(A,A)\ > C^IFI 1 / 2 !^ 1 / 2 

whenever AcF with \A\ > CIF^ 1 / 16 . 

Again, this is a reasonably good classification of the polynomials which weakly 
expand, except that in the case (i) of additive structure, some further information 
on the ratio b/a of the two elements should be obtained (this ratio should be "low 
complexity" in some sense). We will not pursue this issue here; it boils down to 
the expansion properties of A + a A for various values of a, an issue studied in |42) . 
[6], [IT] in the case when A lies in the integers Z rather than in F (see also [50] 
for some initial results in the finite field setting) . Theorem [2] will be established in 
Section [8j 

We now turn to the analogue of the above results for almost strong asymmetric 
expansion, where our results are unfortunately somewhat less satisfactory. The 
situation here is necessarily more complicated, as can be seen by the following 
simple observation: if a polynomial P is an almost strong asymmetric expander, 
then its square P 2 is automatical!}!! a moderate asymmetric expander, but not 
an almost strong asymmetric expander, because P 2 is clearly restricted to the 
quadratic residues. More generally, if P obeys a polynomial identity of the form 

(5) P(f(x 1 ),g(x 2 )) = h(Q(x 1 ,x 2 )) 

for some polynomials f,g,h:F—*F and Q : F x F — > F with h nonlinear and 
/, g non-constant, then it is likely that P will not be an almost strong asymmetric 



4 This observation technically answers the question posed at the end of 1361 §1] as to whether 
there are moderate expanders which are not strong expanders; for instance one can take the square 
(x 2 -\-x\X2 + X3) 2 of the strong expander of Shkredov |57| . However, this is something of a "cheat" , 
and the interesting question remains of whether there exists a non-composite polynomial P which is 
a moderate expander but not a strong expander. In view of Theorem[3]below, this basically reduces 
(in the large characteristic case, at least) to the task of locating a non-composite polynomial that 
obeys an algebraic constraint JTJl without having additive or multiplicative structure. 
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expander, because P(f(F),g(F)) C h(F) and h(F) is not, in general, equajlto all of 
F and is instead usually just a dense subset of F. An example of such a polynomial 
identitjQ is 

(6) P{x^x%) = P{x u x 2 ) n 

when P is a monomial of the form P(xt, x 2 ) = x\ x\ and a,b,n are arbitrary natural 
numbers, which shows that P maps n th powers to n th powers. 

Our next main result is an attempt to assert that this is the only additional 
obstruction to almost strong expansion, beyond the obstructions already identified 
in Theorem 1381 Unfortunately, due to limitations in our arguments, we will be 
forced to generalise the above example, in which the polynomials f,g,h,Q are 
defined over the algebraic closure F of F, and for which the domains of /, g, Q are 
constructed using affine algebraic curves (possibly of positive genus) instead of the 
affine line. More precisely, we have: 

Theorem 3 (Almost strong asymmetric expansion) . For any degree d, there exists 
a constant C such that the following statement holds. Let F be a finite field of 
characteristic at least C , and let P : F x F — > F be a polynomial of degree at most 
d. Then at least one of the following statements hold: 

(i) (Additive structure) One has 

P(xt,x 2 ) = Q(Ft(xt) + F 2 (x 2 )) 

for some polynomials Q, Ft,F 2 . 

(ii) (Multiplicative structure) One has 

P(x 1 ,x 2 )=Q(F 1 {x 1 )F 2 {x 2 )) 

for some polynomials Q, Ft, F 2 . 
(hi) (Algebraic constraint) There exist irreducible affine curves V C F ,W C 
F of complexity at most C and definable over an extension of F of degree 
at most C , as well as polynomial maps f : F — > F, g : F — > F, Q : 
F xF F, h : F — > F of degree at most C , and whose coefficients lie 
in an extension of F of degree at most C , such that the restrictions of /, g 
to V, W respectively are non- constant, and h has degree at least two, and 
one has the identity 

(7) P(f(x 1 ),g(x 2 )) = h(Q(x 1 ,x 2 )) 

for all X\ G V and x 2 G W. 
(iv) (Almost strong asymmetric expansion) One has 

\F\P(At,A 2 )\<C\F\(\^^ ] 



JF|' 

whenever A\,A 2 are non-empty subsets ofF. 



^One can however construct some examples of nonlinear polynomials h that are bijective on a 
certain finite field, for instance x 1— > x 3 is bijective on a field F p of prime order p whenever p — 1 
is coprime to 3. 

^Admittedly, this is also an example of a polynomial P that obeys the multiplicative structure 
Q. We were unable to either exhibit or rule out an example of a polynomial P obeying ifSJ but 
not J3J, but it seems of interest to resolve this question. 

^We will review algebraic geometry notation such as "irreducible" , "affine" , "curve" , and 
"complexity" in the next section. 
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We establish this result in Section HI While this theorem in principle gives 
a purely algebraic description of the polynomials P that are not almost strong 
asymmetric expanders, it is not fully satisfactory, due to the excessively complicated 
form of (iii). It would be of interest to simplify this constraint^, in order to make 
this theorem more useful in applications (and in particular, to allow one to exhibit 
explicit examples of strong expander polymomials in two variables). 

It is likely that one can iterate the above results to obtain some classification of 
various types of expanding polynomials in three or more variables, but we will not 
pursue this question here. 



1.2. An algebraic regularity lemma. The main new tool that we introduce to 
establish the above results is an algebraic regularity lemma which improves upon 
the Szemeredi regularity lemma [611 in the case of dense graphs that are definable 
in the language of fields over a field of large characteristic; this lemma seems to be 
of independent interest. To describe this new lemma, let us first give a formulation 
of the usual regularity lemma: 

Lemma 4 (Szemeredi regularity lemma). [61] lfe>0, then there exists C = C £ > 
such that the following statements hold: whenever V, W are non-empty finite sets 
and E C V xW, then there exists partitions V = Vi U . . . U V a , W = W\ U . . . U Wb 
into non-empty sets, and a set I C {1, . . . , a} x {1, . . . , b} of exceptional pairs, with 
the following properties: 

(i) (Low complexity) a,b <C. 

(ii) (Few exceptions) E(ij) 6 j Nl^'l < e\V\\W\. 

(iii) (e -regularity) For all (i, j) G {1, . . . , a} x {1, . . . , b}\I , and all A C Vi, B C 



Wj , one has 



\Ef](Ax B)\ - dij\A\\B\\ < eWWW^ 



where d %3 := lE ^^ j)l . 

The dependence of C e on s is notoriously poor (tower exponential in nature); 
see [H]. 

Now we restrict attention to definable sets. If F is a field, and n > is a natural 
number, a definable subset of F™ is defined to be any set of the form 

(8) {(xi, ...,£„) € F n | p(xi, . . . , x n ) is true} 

where p() is any formula involving n variables x\, . . . ,x n and a finite number of 
additional constants c\ , . . . , c m € F and bound variables y\ , . . . , yi , as well as the 



8 For instance, it is conceivable that one could eliminate the cases when the curves V, W have 
positive genus, so that they could be replaced by the affine line F, thus reducing the constraint 
10 back to the simpler constraint (0. Also, it might be possible (perhaps by utilising some Galois 
theory) to reduce to the case where the curves V, W and maps f,g,Q,h are defined over F rather 
than over F. Finally, it may be possible to get some more effective bounds on the degrees of 
V, W, /, g, Q, h in terms of the degree of P, although the example (0 indicates that one may have 
to make some "minimality" assumptions on these objects before an effective degree bound can be 
obtained. We were unable to achieve any of these goals, but believe that they are all worthwhile 
to pursue. Some variant of Ritt's theory of decomposition into prime polynomials (see e.g. [69]) 
may be relevant for this purpose. 
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ring operation^ +, X, parentheses (, ), the equality sign =, the logical connectives 
-i, A, V, => , and the quantifiers V, 3 (where the quantification is understood to be 
over the field F). Thus, for instance, the F-points 

V(F) = {(li In)eP I P 1 {x l ,...,X n ) = ...=P m (x 1 ,...,X n ) = 0} 

of an algebraic variety V defined over F, where Pi, ... , P m : F™ — > F are polynomi- 
als with coefficients in F, form a definable set. A bit more generally, the F-points 
of any constructible set in F" (i.e. a boolean combination of a finite number of 
algebraic varieties in F") is a definable set. As is well known, in the case when F 
is algebraically closed, the constructible sets are the only definable sets, thanks to 
the presence of quantifier elimination (or Hilbert's nullstellensatz) in this setting; 
but for non-algebraically closed fields, other definable sets also exist. For instance, 
the set 

(9) Q := {x £ F | 3y € F : x = y 2 } 

of quadratic residues in F is definable, but is usually not constructible. 

Now we specialise to the case where F is a finite field. Strictly speaking, the 
theory of definable sets on such fields is trivial, since every subset of F" is finite and 
thus automatically definable. However, one can recover a more interesting theory 
by limiting the complexity of the definable sets being considered. Let us say that a 
subset E of F" is a definable set of complexity at most M if the ambient dimension 
n is at most M , and E can be expressed in the form ([8]) for some formula <f> of 
length at most M, where we consider all variables, constants, operations, parenthe- 
ses, equality symbols, logical operations, and quantifiers to have unit length. One 
is then interested in the regime where M stays bounded, but the cardinality or 
characteristic of F goes to infinity. 

We can now give the algebraic regularity lemma. 

Lemma 5 (Algebraic regularity lemma). If M > 0, then there exists C = Cm > 
such that the following statements hold: whenever F is a finite field of characteristic 
at least C , V, W are non-empty definable sets over F of complexity at most M , and 
E C V x W is another definable set over F of complexity at most M , then there 
exists partitions V = Vi U. . .U V a , W = W\ U . . . U Wb, with the following properties: 

(i) (Largeness) For all i 6 {1, . . . , a} and j <E {1, . . . , b}, one has \Vi\ > | Vj/C 
and \Wj \ > \ W\/C. In particular, a,b< C. 

(ii) (Bounded complexity) The sets V\, . . . ,V a , W\, . . . , Wb are definable over F 
with complexity at most C . 

(iii) (1F| _1 ^-regularity) For all (i,j) G {1, . . . , a} x {1, . . . , b}, and all A C 
Vi , B C Wj , one has 

(10) \\En(Ax Bjl-d^AWBW <C\F\- 1 l i \V l \\W ] \ 
where dij := - v^llWj ' 

Comparing this lemma with Lemma IH we see that one has substantially more 
regularity (a power gain in |F|), and no exceptional pairs (i,j), thanks to the 
bounded complexity of the set E. Furthermore, the cells V%, ... ,V a , W%, . . . , Wb 



®One could also, if one wished, also include the inversion operation () — 1 (after handling some- 
how the fact that — 1 is undefined), but of course any formula involving this operation can be 
replaced with an equivalent (albeit slightly longer) formula involving the operations +, — , X and 
some additional variables and existential quantifiers. 
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are not arbitrary, but are themselves definable with bounded complexity. Let us 
illustrate this lemma with two simple examples: 

Example 6. Let V = W = F\{0}, and let E be the set of all pairs (v,w) such 
that vw is a quadratic residue; this is clearly a definable set of bounded complexity 
Then we can regularise E by partitioning V = W into the quadratic residues and 
the non-quadratic residues, with the set E having density either zero or one in each 
of the four pairs Vi x Wj created by this partition. 

Example 7 (Paley graph). Let V = W = F, and let E be the set of all pairs (v, w) 
such that v — w is a quadratic residue. Standard Gauss sum estimates then show 
that 

l^n^xB^^HBi+oGFl 2 - 1 / 2 ) 

for any 4,B C F, thus giving (fT0|) with the exponent 1/4 improved to 1/2, and 
with a = b = 1. (It may be that this improvement of 1/4 to 1/2 is in fact true 
in all cases; our method is unable to show this due to the fact that it invokes the 
Cauchy-Schwarz inequality at one point, which halves the exponent gain that one 
expects in bounds such as ([TO]).') 

As a very rough first approximation, one can interpret Lemma [5] as an assertion 
that any definable subset of V X W behaves like some combination of the two basic 
examples listed above. 

Remark 8. A result in a somewhat similar spirit to Lemma [5] was established by 
Kowalski [43] . In our notation, the main result is as follows: if E is a definable subset 
of complexity at most M over a finite field F of prime order p, /, g : F — > F are 
non-constant polynomials of degree at most M, and \ '■ F — s- C is any multiplicative 
character, then 

\J2x(9(x))e 2 * if(x)/p \<C M VP 

for some quantity Cm depending only on M. This allows for a substantial generali- 
sation of Example [7] to definable Cayley graphs, and also allows for some "twisting" 
of such graphs by multiplicative characters. It is concievable that one could simi- 
larly twist the algebraic regularity lemma by allowing some "twisted" generalisation 
of the notion of definable set, but we will not pursue this issue here. 

Remark 9. A recent paper of Malliaris and Shelah [17] links the absence of ex- 
ceptional pairs in the regularity lemma with the concept of stability from model 
theory (roughly speaking, the inability to definably create large induced copies of 
the half-graph). The algebraic regularity lemma can thus be viewed as asserting a 
(somewhat exotic) form of model-theoretic stability for the language of finite fields 
of large characteristic. 

Actually, in applications we will use an iterated form of the regularity lemma in 
which one works not with a dense definable subset E C V x W of the product of 
two definable sets, but a subset E C V% x . . . x Vfe of a bounded number of definable 
sets (for our application to expansion, we will take k = 4); see Theorem [35] This 
lemma is to the k = 2 case as the Chung hypergraph regularity lemma [10) (see also 
[20) ) is to the Szemeredi regularity lemma. It seems of interest to obtain stronger 
hypergraph regularity lemmas, analogous to those in [26], [52], [51], [63], but we 
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will not pursue this matter here. (See [45] for a discussion of the general poset of 
hypergraph regularity lemmas.) 

The regularity lemma is not directly applicable to the expansion problem, basi- 
cally because the graph {(xi, x 2 , P(xx, x 2 )) | x±,x 2 € F} of the polynomial is too 
sparse a subset of F 3 for this lemma to be useful. However, as observed in [6], if 
one applies the Cauchy-Schwarz inequality a few times (in the spirit of [25,), one 
can effectively replace the above graph by the set 

{(P(x 1 ,x 2 ),P(x 1 ,y 2 ),P(y 1 ,x 2 ),P(y 1 ,y 2 )) \ x 1 ,x 2 ,y 1 ,y 2 e F}, 

which for "generic" P will be a dense subset of F 4 to which the (iterated) algebraic 
regularity lemma can be applied. (It is these applications of Cauchy-Schwarz that 
reduce the exponents in our final expansion results to be 1/8 or 1/16 instead of 
1/4.) There will be some exceptional cases in which this set fails to be Zariski 
dense, but we will be able to show (using a Riemann surface^ argument!) that 
those cases only arise when one has additive or multiplicative structure, giving rise 
to the trichotomies and tetrachotomies in Theorems [T] [2] [3] 

Our proof of the regularity lemma will use two somewhat exotic ingredients. 
The first is the use of nonstandard analysis, in order to convert a quantitative 
problem involving finite fields of large characteristic into an equivalent qualitative 
problem involving pseudo-finite fields of zero characteristic. The main reason for 
using the nonstandard formalism is so that we may deploy the second ingredient, 
which is the theory of the etale fundamental group of algebraic varieties over fields 
of characteristic zer The reason that the etale fundamental group comes into 
play is because it plays a key role in counting the number of connected components 
of certain algebraic varieties that will arise in the argument, and this in turn is 
needed to count the number of F-points on those varieties thanks to the Lang- Weil 
estimates [H]. Some special cases of this general theme are already visible in the 
work of Bukh and Tsimerman [7] on polynomial expansion; the etale fundamental 
group is not explicitly mentioned in their paper, but is implicitly present in some 
of the algebraic geometry lemmas used in that paper (e.g. [7] Lemma 21]). 

As a byproduct of our reliance on nonstandard methods, we do not obtain any 
quantitative bounds in our main theorems; in particular, we cannot explicitly give 
values for the constants C in those theorems. In principle such bounds could even- 
tually be extracted from suitable finitisations of the arguments here, but this would 
require (among other things) effective versions of results on the etale fundamental 
group (in the case of sufficiently large characteristic, rather than in zero character- 
istic), which seems feasible but only after an enormous amount of effort, which we 
will not expend here. 



^This argument is closely related to the classical fact that the only one-dimensional alge- 
braic groups over the complex numbers are (up to isomorphism) the additive group (C, +), the 
multiplicative group (<C X ,-), and the elliptic curves. The elliptic curve case will eventually be 
eliminated because the underlying map P is polynomial rather than merely rational. 

11 One can also define the etale fundamental group in positive characteristic, but the theory is 
significantly less favorable for our purposes; in particular, one cannot guarantee that this group is 
topologically finitely generated. This is one of the main reasons why our results are limited to the 
large characteristic setting. We will also take advantage of characteristic zero to use the theory 
of Riemann surfaces in order to analyse algebraic curves, although this is largely for reasons of 
convenience, as many of the Riemann surface facts we will use have purely algebraic counterparts 
that are also valid in positive characteristic. 
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Remark 10. Throughout this paper we shall freely use the axiom of choice. How- 
ever, thanks to a well known result of Godel [23], any result that can be formalized 
in first-order arithmetic (and the main results of this paper are of this type) and 
is provable in Zermelo-Frankel set theory with the axiom of choice (ZFC), can also 
be proven in Zermelo-Frankel set theory without the axiom of choice (ZF). 

1.3. Acknowledgments. The author is greatly indebted to Brian Conrad and 
Jordan Ellenberg for their many patient explanations of the etale fundamental 
group, to Jozsef Solymosi for useful suggestions, and to Antoinc Chambert-Loir, 
Jordan Ellenberg, Norbert Hegyvari, and Van Vu for corrections and comments. 
The author was also partially supported by a Simons Investigator award from the 
Simons Foundation and by NSF grant DMS-0649473. 

2. Algebraic geometry notation 

In this section we lay out the basic algebraic geometry notation that we will need 
throughout this paper. 

Definition 11 (Algebraic varieties). Let n be a natural number, and let k be an 
algebraically closed field. An affine variety (or more precisely, an algebraic set) in 
k n over A: is a set V of the form 

(11) V={xek n \P 1 (x) = ... = P m (x) = 0} 

for some polynomials Pi, ... , P m : k n — > k. Similarly, we define the projective space 
P ra (fc) to be the space of equivalence classes [xi, . . . , x n +i] of tuples (x\, . . . , x n +i) <= 
k n+1 \{0} after quotienting out by dilations by k, and define a projective variety in 
P n (k) to be a set V of the form 

(12) V = {x e P n (fc) | Pi Or) = ... = P m (x) = 0} 

for some homogeneous polynomials Pi, ... , P m : k n+1 —> k (note that the constraint 
Pi (x) = ... = P m (x) = is well-defined in P"(fc)). We embed k n in P"(fc) in the 
usual manner, identifying (x\, . . . , x n ) with [xi,..., x n , 1]; thus for instance every 
affine variety can be viewed as a subset of an associated projective variety. 

A subset of P™(fc) is said to be a quasiprojective variety if it is the set-theoretic 
difference of two projective varieties in P n (fc). Thus for instance the set-theoretic 
difference of two affine varieties in k n is a quasiprojective variety. A constructible 
set in k n is a boolean combination of finitely many affine varieties in k n . As noted 
in Section [TT21 constructible sets are definable over k (indeed, as k is algebraically 
closed, the two concepts coincide in this setting), and so we can inherit the notion 
of complexity for such sets. 

We define the Zariski topology on V n (k) by declaring the projective varieties to 
be the closed sets; thus, for instance, the Zariski closure E of a subset E of P™(fc) 
is the intersection of all the projective varieties which contain that set. One can 
then induce the Zariski topologies on other varieties by restriction. For instance, 
in the affine space k n , the Zariski closed sets are given by the affine varieties. 

An affine (resp. projective) variety is geometrically irreducible, or irreducible for 
short, if it is non-empty and cannot be expressed as the union of two strictly smaller 
affine (resp. projective) varieties. We say that a quasiprojective variety (or more 
generally, a constructible set) is irreducible if its Zariski closure is irreducible. It is 
well known (see e.g. [3H1 Propositions 1.5.2, 1.5.3]) that any affine (resp. projective) 
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variety can be uniquely decomposed into finitely many irreducible subvarieties, no 
two of which are contained in each other. 

The dimension dim(V) of a non-empty affine (resp. projective) variety V is the 
largest natural number d for which there is a chain 

V C Vi C . . . C V d C V 

of irreducible affine (resp. projective) varieties Vb, • • • , V4; this is always a finite 
natural number. We adopt the convention that the empty set has dimension —00. 
The dimension of a quasiprojective variety (in k n or P™(fc)) is defined to be the 
dimension of its Zariski closure in the indicated ambient space. A variety will be 
called a curve if it has dimension one. 

If V, W are two varieties with V C W, we say that V is a subvariety of W. If 

V C W, we say that V is a strict subvariety of W. 

Let F be a subfield of k. We say that an affine (resp. projective) variety is 
defined over F if one can find polynomials Pi, ... , P m with coefficients in F for 
which (fTTj) (resp. (|T2"j) ) holds. A quasiprojective variety is defined over F if it is 
the set-theoretic difference of two affine varieties defined over F. If V C k n is a 
quasiprojective variety, we define V(F) := V n F n to be the -F-points of V; note 
that this is a definable subset over F. 

Let V be a quasiprojective variety. If V C k n , a regular function on V is a 
function / : V — > fc which, at every point p in 1/, agrees with a rational function from 
(an open dense subset of) k n to k on an open neighbourhood of p in V. If instead 
y (- pn^^ a regular function on is a function which is regular when restricted 
to the n + 1 affine subsets {[#1, . . . ,x n+ i] e F' l (fc) | cCj 7^ 0}, i = 1, . ..,n + 1 
that cover F n (k), each of which can be identified with the affine space k n in the 
obvious manner. The ring of regular functions on V is denoted k[V], and its fraction 
field (which is well defined for irreducible V) is denoted k(V). A map <f> : V — > W 
between two quasiprojective varieties is a regular morphism if every regular function 
on W pulls back by to a regular function on V. A regular isomorphism is an 
invertible regular morphism whose inverse is also regular. 

In most of this paper, we will only need to work with constructible subsets of 
affine space k n , such as affine varieties. However in Section Owe will also need 
to work with projective varieties (in order to use the theory of compact Riemann 
surfaces) . 

If V is an affine variety, then k[V] is just the restriction of the polynomials on 
k n to V; see [35l Theorem 3.2]. If V is an irreducible projective variety, then k[V] 
consists only of the constant functions; see [35J Theorem 3.4]. 

We recall some basic facts about dimension. Firstly, we have dim(fc") = n for 
any n, and dim(y x W) = dim(F) + dim(VK) for any constructible sets V, W; see 
e.g. [49l §1.7]. We clearly also have dim(V) < dim(M / ) whenever V C W, with 
strict inequality when W is an irreducible affine (resp. projective) variety and V is 
a strict affine (resp. projective) subvariety. By construction, all affine or projective 
varieties of finite non-zero cardinality have dimension zero, and all such varieties of 
infinite cardinality have dimension greater than zero, and so the same is also true 
for constructible sets. We also have the following basic fact: 

Proposition 12 (Projections). Let k be an algebraically closed field, and let tt : 

V — > W be a regular map between two quasiprojective varieties V, W with V irre- 
ducible. Then ir(V) is an irreducible constructible set, and there is a subvariety E 
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ofit{V) of dimension at most dim(7r(V)) — 1 such that n ({x}) DV has dimension 
dim(V) — dim(ir(V)) for all x £ n(V)\E. Furthermore, the set 7r (S) fl V has 
dimension at most dim(V) — 1. 

Proof. See [U Lemma A. 8]. □ 

This has the following consequence. If V is an irreducible constructible set, we 
say that a property P(x) of points in V holds for generic x € V if there is a 
subvariety £ of V of dimension at most dim(V) — 1 such that P(x) holds for all 

x e 

Lemma 13 (Generic Fubini-type theorem). Let V,W be constructible sets, and let 
E be a constructible subset ofVxW. Then the following are equivalent: 

(i) For generic v £ V , one has (v, w) £ E for generic w £ W . 

(ii) For generic w £ W , one has {v, w) £ E for generic v £ V . 
(hi) For generic (v,w) el^x W , one has (v,w) £ E. 

Proof. By symmetry, it suffices to show the equivalence of (i) and (hi). Write 
S := (V^x W)\E. If (hi) holds, then £ has dimension at most dim(V) + dim(W / ) — 1. 
Writing it : £ — >• V for the projection map and using Proposition [T21 we see that 
the generic fibre of ir has dimension at most dim(VF) — 1, giving (i). 

Conversely, if (hi) fails, then £ has dimension dim(V) + dim(iy), and so contains 
at least one of the components Vi x Wj where Vi is a top-dimensional irreducible 
component of V , and similarly for Wj . The complement of S in Vi x Wj then has 
dimension at most dim(V^) + dim(W / ) — 1, and so by the previous arguments, we 
see that for generic v £ Vi, one has w £ £ for generic w £ Wj, which contradicts 
(i) as required. □ 

3. Nonstandard formulation 

As discussed in the introduction, it will be convenient to pass to a nonstandard 
analysis formalism in order to take full advantage of the existing literature in al- 
gebraic geometry and on the etale fundamental group, as well as to be able to use 
some tools from the theory of Riemann surfaces which are only available for vari- 
eties when the characteristic is zero. In this section, we set up this formalism, and 
give the nonstandard version of the main theorems. 

We will assume the existence of a standard universe il which contains all the 
objects and spaces that one is interested in (such as the natural numbers N, the 
real numbers K, finite fields F, constructible sets or varieties defined over F or F, 
maps between such spaces, etc.). The precise construction of this universe is not 
particularly important for our purposes, so long as it forms a set in our external 
set theory. We refer to objects and spaces inside the standard universe as standard 
objects and standard spaces, with the latter being sets whose elements are in the 
former category. Thus, for instance, we refer to elements of N as standard natural 
numbers. 

For the rest of the paper, we fix a non-principal ultrahlter a £ /?N\N on the 
natural numbers, that is to say a collection of subsets of N obeying the following 
axioms: 

(1) If E, F G a, then E D F £ a. 

(2) If E C F C N and E £ a, then F £ a. 

(3) If E C N, then exactly one of E and N\E lies in a. 
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(4) No finite subset of N lies in a. 
The existence of such a non-principal ultrafilter follows easily from Zorn's lemma. 

Throughout the paper, we fix a non-principal ultrafilter a. A property P (n) 
depending on a natural number n is said to hold for n sufficiently close to a if the 
set of n for which P(n) holds lies in a. A set of natural numbers lying in a will 
also be called an a-large set. 

Once we have fixed this ultrafilter, we can define nonstandard objects and spaces. 

Definition 14 (Nonstandard objects and functions). Given a sequence (x n ) n£ N 
of standard objects in it, we define their ultralimit lim n ^ Q x n to be the equivalence 
class of all sequences (y n )neN of standard objects in it such that x n = y n for n 
sufficiently close to a. Note that the ultralimit lim n ^ a x n can also be defined even 
if x n is only defined for n sufficiently close to a. 

An ultralimit of standard natural numbers is known as a nonstandard natural 
number, an ultralimit of standard real numbers is known as a nonstandard real 
number, and so on. 

For any standard object x, we identify x with its own ultralimit lim n _s. a x. Thus, 
every standard natural number is a nonstandard natural number, etc. 

Any operation or relation on standard objects can be extended to nonstandard 
objects in the obvious manner. Indeed, if O is a fc-ary operation, we define 

0{ lim xi, . . . , lim a£) := lim 0(a£, . . . , a£) 

n— ¥a n— ¥a n^a 

and if R is a fc-ary relation, we define i?(lim n _;. Q x n , . . . , lim n _s. a x n ) to be true iff 
R(x n , . . . , x^) is true for all n sufficiently close to a. One easily verifies that these 
nonstandard extensions of O and R are well-defined. 

Example 15. The sum of two nonstandard real numbers lim n ^ Q x n , lim n ^ a y n is 
the nonstandard real number 

lim x n + lim y n = lim x n + y n , 

n— >a n— >a n— >ct 

and the statement lining x n < lim n ^ a y n means that x n < y n for all n sufficiently 
close to a. 

We will use the usual asymptotic notation from nonstandard analysis: 

Definition 16 (Asymptotic notation). A nonstandard real number x € *R is said 
to be bounded if one has \x\ < C for some standard C > 0, and unbounded otherwise. 
Similarly, we say that x is infinitesimal if |a;| < c for all standard c > 0; in the former 
case we write x = 0(1), and in the latter x = o(l). For every bounded real number 
x£*M there is a unique standard real number st(x) e K, called the standard part 
of M, such that x = st(x) + o(l), or equivalently that st(x) — e < x < st(x) + e for 
all standard e > 0. Indeed, one can set st(x) to be the supremum of all the real 
numbers y such that x > y (or equivalently, the infimum of all the real numbers y 
such that x < y). 

We write X = 0(Y), X < Y, or Y > X if we have X < CY for some standard 
C; and we write X = o(Y), X <IC Y, or Y ^> X if we have X < eY for every 
standard e > 0. 

Definition 17 (Ultraproducts) . Let (A n ) n£ N be a sequence of standard spaces 
A n in it indexed by the natural numbers. The ultraproduct Yln^a -^n of the A n is 
defined to be the space of all ultralimits linin^a x n , where x n e X n for all n. We 
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refer to the ultraproduct of standard sets as an nonstandard set; in a similar vein, an 
ultraproduct of standard finite sets is a nonstandard finite set, and an ultraproduct 
of standard finite fields is a nonstandard finite field. We refer to * X :— Jln-s-a ^ 
as the ultrapower of a standard set X; the identification of x with linin^Q, x causes 
X to be identified with a subset of *X. We will refer to the ultrapower *il of the 
standard universe il as the nonstandard universe. 

In a similar spirit, if / n : A n — > Y n is a collection of standard functions between 
standard sets X n , Y" n , we can form the ultralimit f := limn-^ / n to be the function 
from X := Yl n -^a to Y := Iln->a defined by the formula 



We refer to such functions as nonstandard functions (also known as internal func- 
tions in the nonstandard analysis literature). 

As with nonstandard objects, any operation or relation on standard spaces can 
be converted to a nonstandard analogue in the usual manner. For instance, the non- 
standard cardinality of a nonstandard finite set X = Y\ n -ta -^n ' s the nonstandard 
natural number 



Note that this is a different concept from the usual (or external) cardinality of 
X; indeed, nonstandard finite sets usually have an uncountable external cardinal- 
ity. Similarly, if / : X — > *R is a nonstandard function / = lim n _j. Q / n defined 
on a nonstandard finite set X = T[ n ^ a X n , we can define the nonstandard sum 
S x" f(x) to be the nonstandard real number 



A fundamental property of ultralimits is that they preserve first-order statements 
and predicates, a fact known as Los's theorem: 

Theorem 18 (Los's theorem with parameters and ultraproducts) . Let m,k be 
standard natural numbers. For each 1 < i < m, let Xi = lim n _j. Q Xi n be a nonstan- 
dard object, and for each 1 < j < k, let Aj = Yln—y a be a nonstandard set. 
If P(yi, ■ ■ ■ , y m ; B\, . . . , Bk) is a predicate over m objects and k sets, with the sets 
A\ , . . . , Ak only appearing in P through the membership predicate x G Bj for vari- 
ous j and various objects Bj, then P{x\, . . . , x m ; A\, . . . , Ak) is true (as quantified 
over the nonstandard universe if and only if P(xi^ a , . . . , x m ,n; Ai 7ll , . . . , Ai^ a ) 
is true for all n sufficiently close to a (as quantified over the standard universe il) . 

Proof. See e.g. [5j Theorem A. 6]. □ 

Another fundamental property is that of countable saturation: 

Lemma 19 (Countable saturation). Let k be a standard natural number, and 
let X\, . . . , Xk be nonstandard spaces. For each standard natural number n, let 
P n (xi, . . . , Xk) be predicates defined for xi £ Xi, using some finite number of 
nonstandard objects and spaces as constants, and quantified over the nonstan- 
dard universe. Suppose that for any standard natural number N , there exists 
x\ G Xi, . . . , Xk G Xk such that P n (x\, . . . ,Xk) holds for all (standard) n < N. 
Then there exists x\ G X\, . . . ,Xk G Xk such that P n (x\, . . . ,Xk) holds for all 

neN. 



/( lim x n ) := lim f n (x n )- 



\X\ := lim |A n |. 




x£X 



x„ex, 
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Proof. Write X* = Jlm-a -^».n an d -fn = lim n ^oo -Pn,n- By Los's theorem, we see 
that for every standard natural number M, there exists an a-large set Em and ele- 
ments Xi tIlt M G X itI1 for n e Em and i — 1, . . . , k such that P n , n (^i,n,Mj ■ • • 5 £fe,n,&f) 
holds for all n £ Em and n = 1, . . . ,M. By shrinking the i?M if necessary, we 
may assume that the Em are non- increasing in M. We then define M n for any 
n G Ei to be the largest integer in {1, . . . , n} for which n G £m„- If we then set 
Xi := lim n _> Q Xi iIlt M a for i = 1, . . . , k, we see from Los's theorem that x, € X$ for 
all i = 1, . . . , k, and P„(xi, . . . , Xfc) holds for all standard n, as required. □ 

A typical application of countable saturation is the following: if / : X — »• *R is 
a nonstandard function with the property that /(x) is bounded for every x G X, 
then there is a uniform bound |/(x)| < M for some standard M (for otherwise the 
predicates |/(x)| > n would form a counterexample to Lemma This automatic 
uniformity is one advantage of the nonstandard framework: one does not need 
to make as many careful distinctions between the order of various quantifiers in 
one's arguments (but one instead has to carefully distinguish between standard 
and nonstandard quantities). In this paper we will not use countable saturation 
very often, however, because in our applications such uniform bounds are also often 
obtainable directly from algebraic geometry methods (e.g. bounding the degree of 
various varieties or maps). 

We will be working extensively with nonstandard finite fields F = Jln^o F n 
in this paper, which are examples of what are known as pseudo-finite fields in 
the model theory literature, because (by Los's theorem) they obey all the first- 
order sentences in the language of fields that hold for all finite fields. By Los's 
theorem, the algebraic closure F of a nonstandard finite field F is contained in 
the ultraproduct of the algebraic closures F n of the associated finite fields; indeed, 
it is the space of ultralimits lim n _> Q x n , where for all n sufficiently close to a, 
x n lies in an extension of F n of degree at most C for some C independent of 
n. The nonstandard finite field F has a nonstandard Frobenius endomorphism 
Frobp : F — > F, defined as the restriction to F ultralimit of the standard Frobenius 
endomorphism FrobF„ : F n — > F n defined by 

Frobp n (x n ) := x|f='. 

(Note that FrobF„ preserves every finite extension of F n , and so Frobp is well- 
defined on F.) Note (again by Los's theorem) that F can be viewed as the set of 
fixed points of Frobp in F, since F n is the set of fixed points of Frobp n in F n for all 
n. Later on, we will use this Frobenius endomorphism to determine which varieties 
over F are actually defined over F. 

Ultraproducts interact well with definable sets. From Los's theorem, we see that 
if F = Iln^Q ^ n is a nonstandard field and d is a standard natural number, then a 
set E C F d is definable over F if and only if E can be expressed as an ultraproduct 
E = Iln^Q ^n, where for all n sufficiently close to a, E n C F^ is definable over 
F„ with complexity at most M, for some M independent of n. 

In a similar vein, a function P : F — > F on a nonstandard field F = Iln^a F n 
is an externa^^ polynomial (that is, a polynomial in the usual sense) if and only 



Taking the ultralimit of polynomials whose degree goes to infinity instead of being uniformly 
bounded will lead to a function which is a nonstandard polynomial, but not an external one. For 
instance, the nonstandard Frobenius endomorphism Frobp is a nonstandard polynomial, but is 
not external if the characteristic of the F n goes to infinity. 
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if it is an ultralimit P = lini n ^Q, P n of (standard) polynomials P n ■ F n — > F n of 
uniformly bounded degree. 

We can now give the nonstandard version of the main theorems stated in the 
introduction. We first give the nonstandard version of Theorem [TJ 

Theorem 20 (Moderate asymmetric expansion, nonstandard formulation). Let F 

be a nonstandard finite field of (external) characteristic zero, and let P : F x F — > F 
be an (external) polynomial. Then at least one of the following statements hold: 

(i) (Additive structure) One has 

P{x 1 ,x 2 ) = Q{F 1 {x l ) + F 2 {x 2 )) 

(as a polynomial identity in the indeterminates X\,x 2 ) for some (external) 
polynomials Q,F\,F 2 : F — >• F. 

(ii) (Multiplicative structure) One has 

P(xt,x 2 ) = Q(F 1 (x 1 )F 2 (x 2 )) 

for some (external) polynomials Q, F\, F 2 : F — > F. 

(iii) (Moderate asymmetric expansion) One has 

\P{A 1: A 2 )\ » |F| 

whenever A\, A 2 are nonstandard subsets of F with \A\\\A 2 \ ^> |F| 2-1 / 8 . 

Let us now see why Theorem [20] implies Theorem [T] Suppose for contradiction 
that Theorem l20l was true, but Theorem Q] failed. Carefully negating the quantifiers 
(and using the axiom of choice), we conclude that there is a standard natural 
number d such that, for every standard natural number n, one can find a finite 
field F n of characteristic at least n, and a polynomial P n : F n x F n — > F n of degree 
at most d, such that P n is not expressible in the form (J3)) or (|4]) for any polynomials 
Qn, -Pi, id F 2j a, and such that there exist subsets A\ yIl , A 2 ^ of F n with 

\Ax, a \\A 2 , n \ >n\F n \ 2 -^ s 

but 

\P n {Ai^A 2 ^)\ ^n-^Pnl. 
We now take ultralimits, giving the nonstandard finite field F := Jln^a F n with 
nonstandard subsets A; := Jln^a Ai, n for i = 1, 2 and the map P := linin^Q P n . 
For any standard natural number k, F n has characteristic greater than k for all 
but finitely many n, so by Los's theorem, F does not have characteristic k for any 
positive fc, and thus has characteristic zero. Because the P n are polynomials of 
degree at most d, P is an (external) polynomial also. From Los's theorem, we have 
|-4i | \A 2 1 3§> |F| 2-1 / 18 and |P(Ai, A 2 )\ |F|, and P cannot be expressed in either 
of the two forms Q , ((4]) . This gives a counterexample to Theorem [20l and the 
claim follows. 

It is also not difficult to show that Theorem Q] implies Theorem [201 but we will 
not need this implication here and so will leave it to the interested reader. 

In a similar vein, Theorem [2] and Theorem [3] follow from these nonstandard 
counterparts: 

Theorem 21 (Weak expansion, nonstandard formulation). Let F be a nonstandard 
finite field of (external) characteristic zero, and let P : F x F — > F be an (external) 
polynomial. Then at least one of the following statements hold: 
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(i) (Additive structure) One has 

P(x u x 2 ) = Q(aF(xi) + bF(x 2 )) 

for some polynomials Q, F : F — > F, and some elements a, b G F. 

(ii) (Multiplicative structure) One has 

P(x u x 2 )=Q(F(x 1 ) a F(x 2 ) b ) 

for some polynomials Q, F : F —¥ F, and some standard natural numbers 
a, b. 

(iii) (Weak expansion) One has 

\P(A,A)\ » IFI 1 / 2 ^! 1 / 2 
whenever A C F is a nonstandard subset with \A\ 3S> | F 1 1 1 i / is _ 

Theorem 22 (Almost strong asymmetric expansion, nonstandard formulation). 

Let F be a nonstandard finite field of (external) characteristic zero, and let P : F x 
F — > F be an (external) polynomial. Then at least one of the following statements 
hold: 

(i) (Additive structure) One has 

P{x 1 ,x 2 )=Q{F 1 {x 1 )+F 2 {x 2 )) 

for some polynomials Q, F\,F 2 . 

(ii) (Multiplicative structure) One has 

P{xi,x 2 ) = Q{Fx{xx)F 2 {x 2 )) 

for some polynomials Q, F±, F 2 . 

(iii) (Algebraic constraint) One has irreducible affine curves V,W defined over 
F and the constraint 

P{f(x 1 ),g{x 2 )) = h{Q{x 1 ,x 2 )) 

for all x\ G V, x 2 <E W and some polynomials f : V — > F, g : W — > F, 
h : F — > F, Q : V xW — >F defined over F, with f,g non-constant and h 
having degree at least two. 

(iv) (Almost strong asymmetric expansion) One has 

iFWi^ 2 )i«iFi (^y 1/2 

whenever A\,A 2 are non-empty nonstandard subsets ofF. 

The derivations of Theorem [2 [3] from Theorem [2lJ [22] are closely analogous to 
the derivation of Theorem [T] from Theorem [20] and are omitted. 
Finally, Lemma [5] also follows from a nonstandard counterpart: 

Lemma 23 (Algebraic regularity lemma, nonstandard formulation). Let F be a 

nonstandard finite field of (external) characteristic zero, let V,W be non-empty 
definable subsets over F , and let E C V x W be another definable set. Then there 
exists partitions V = V\ U . . . U V a , W — W\ U . . . U Wb into a (standard) finite 
number of definable sets, with the following properties: 

• (Largeness) For all i G {1, . . . , a} and j G {1, . . . , b}, one has \Vi\ 3> |V|/C 
and \Wj\ > \W\/C. 
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a 1/4 -regularity) For all £ {1, . . . , a} X {1, . . . , b}\I , and all A C 

Vi, B C Wj, one has 



\\En(AxB)\-d t3 \A\\B\\«\F\-^\V t \\WA 

where dy := lE ^^ j)l . 

Again, we omit the derivation, as it is closely analogous to the previous deriva- 
tions. 

It remains to establish Theorem BUI Theorem [521 Theorem [2T] and Lemma 1231 
This will be the focus of the remainder of the paper, with the latter lemma being 
used as a crucial tool to prove the first three theorems. 

As mentioned previously, there are two main reasons why we move to a nonstan- 
dard framework. The first is that one no longer has to explicitly keep track of the 
complexity of various definable sets or algebraic varieties that one will shortly en- 
counter in the argument. This allows one to use many existing results from algebraic 
geometry without modification, as these results are usually phrased qualitatively 
rather than quantitatively, and so do not come with explicit bounds on complexity. 
The other reason is that now that we have passed to a field of characteristic zero, 
many aspects of algebraic geometry become simpler; in particular, varieties are 
generically smooth, and etale fundamental groups are topologically finitely gener- 
ated. Furthermore, we can take advantage of embeddings into the complex field C 
(Lefschctz principle) in order to exploit the theory of Riemann surfaces. It would 
be rather difficult (though not entirely impossible) to replicate these facts in the 
original setting of finite fields of large characteristic. 



4. Definable sets and Lang- Weil type bounds 

In order to prove the algebraic regularity lemma, we will need some results 
in the model theory literature [2J, jJD], [2TJ, [5], [S], [I3J on definable subsets of 
nonstandard finite fields. 

We will need the fact that definability over a nonstandard finite field F can be 
detected using the Frobenius map: 

Lemma 24. Let F be a nonstandard finite field, with algebraic closure F, and let 
FcF be a quasiprojective variety. Then V is defined over F if and only if it is 
invariant with respect to the action of the nonstandard Frobenius map Frobp ( which 

—n . 

acts componentwise on F J. 

Proof. The "only if" part is clear, so we focus on the "if" part. First suppose 
that V is an affine variety that is invariant under FrobF- Then the ideal I(V) of 
polynomials that vanish on V is also FrobF-invariant (where FrobF acts on each 
coefficient of a given polynomial separately). If we let Pi, . . . ,P m be the reduced 
Grobner basis of I(V) with respect to lexicographical ordering (see e.g. [13]), then 
this basis is unique, and is thus also FrobF-invariant, that is to say the coefficients 
of the P), lie in F. Thus V is defined over F as claimed. 

Now suppose V is a quasiprojective variety that is invariant under FrobF- The 
Zariski closure V of V in F is then also invariant under FrobF, and by the preceding 
discussion is thus defined over F; similarly for the affine variety V\V. The claim 
follows. □ 
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We will also need the fact that definable sets over nonstandard finite fields are 
projections of the F-points of varieties, and their cardinality is comparable to a 
power of |F|. More precisely, we have 

Theorem 25 (Almost quantifier elimination). Let F be a nonstandard finite field, 
and let E be a subset of F" for some (standard) natural number n. Then E is a 
definable set if and only if it can be expressed as the intersection of finitely many 
sets, each of the form 

(13) {x G F" | 3t G F : P(x, t) = 0} 

for some polynomial P : F™ x F — > F with coefficients in F. 

Furthermore, if E is definable, the Zariski closure E of E in F is the union 
of finitely many geometrically irreducible affine varieties defined over F, and the 
nonstandard cardinality \E\ of E is given by 

(14) \E\ = (a + 0(|Fr 1/2 ))|F| dim(:g) 

for some standard positive rational number a (with the convention that |F|~°° = 0). 
In particular, we have 

(15) | F |dim(S) < |£| < | F |dim(E)_ 

Proof. These are the main results of [9] . The fact that E consists only of varieties 
defined over F follows from Lemma [2U since E and hence E is Frobenius-invariant. 
The fact that the exponent of F is the dimension of the Zariski closure of E is [9j 
Proposition 4.9]. □ 

We illustrate the almost quantifier elimination (|25[) with some simple examples. 
The space of quadratic residues of F that include can be expressed as 

{x G F | 3t G F : x - t 2 = 0}, 
while the set of non-zero elements of F can be expressed as 

{x G F | 3t G F : xt - 1 = 0}, 
the singleton set {0} can be expressed as 

{x G F | 3t G F : x = 0}, 
and the set of nonquadratic residues of F (again including 0) can be expressed as 

{x G F | 3t G F : ax - t 2 = 0}, 

where a is an invertible quadratic non-residue. By intersecting these sets together, 
we can then create other definable sets, such as the set of all non-zero quadratic 
residues. These examples also show that the quantity a appearing in (Ti"4"]) need not 
be an integer (for instance, in the case of quadratic residues, a is 1/2 when the 
characteristic is not equal to 2). 

Theorem l25l gives the order of magnitude on the cardinality \E\ of a definable 
set, but does not specify exactly what the rational constant a in the asymptotic 
(|14j) . The following bound computes this constant in the case of a quasiprojective 
variety. 

Lemma 26 (Lang- Weil bound). Let F be a nonstandard finite field, and let V C F™ 
be a quasiprojective variety defined over F. Then one has 

\V(F)\ = (c + 0(|Fr 1/2 ))|F| dim(y) 
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where c are the number of top- dimensional geometrically irreducible components of 
V (i.e. components of dimension exactly dim(V)) which are defined over F. 

Proof. By Theorem [25] (or by cruder estimates, such as [44] Lemma 1]), any affinc 
variety of dimension strictly less than dim(V) has at most 0(|F| dim ( v )-i) F-points. 
Thus, by replacing V by its Zariski closure and then removing all lower dimensional 
components, we may assume without loss of generality that V is a geometrically 
irreducible affine variety. 

If V is not defined over F, then by Lemma [24] Frobr(^) is a different variety 
from V, and so V H FrobF<V) has dimension strictly less than dim(V). But this 
variety contains all the F points of F, and so V only has at most 0(|F| dim (^ -1 ) 
F-points, and the claim follows in this case (with c = 0). 

Finally, if V is geometrically irreducible and defined over F, the claim follows 
from gU Theorem 1]. □ 

5. Proof of regularity lemma 

We now prove Lemma [23] The first step is to pass from the set £ to a more 
tractable counting function (essentially the "square" of E), as follows. 

Proposition 27 (First reduction). LetF be a nonstandard finite field of character- 
istic zero, let V, W be definable sets over F with V, W geometrically irreducible, and 
let E be a definable subset ofVx W. Let \i : W x W — > *N denote the nonstandard 
counting function defined by the formula 

(16) (i(w, w') :=\{v€ 7(F) | (v, w), (v, w') e E}\ 

for all w,w' € W. Then one can partition W into a (standard) finite number of 
definable subsets W\, . . . , W m such that, for any 1 < i,j < m, there is a standard 
rational c y such that n{w,w') = (c tJ + OQFl- 1 / 2 ))^] for all but 0{\F\~ l / 2 \W\ 2 ) 
of the pairs (w,w ! ) € Wi x W . 

Let us now see how Proposition l27l implics Lcmma [2"3"l This will be an application 
of the "TT* method" from harmonic analysis. Let F, V, W, E be as in Lemma 
|2"3"1 By decomposition we may take V, W to be geometrically irreducible. We use 
Proposition[57]to partition W into finitely many definable components Wi with the 
stated properties. Observe that at least one of the Wi must have a Zariski closure 
of dimension dim(VK) (and thus equal to W, by the irreducibility of W); and if any 
component has Zariski closure with dimension strictly less than dim(M / ), it may be 
safely absorbed into one of the other components without affecting the conclusion 
of the proposition (thanks to (fTS"]) ). Thus we may assume that all Wi have the same 
Zariski closure as W, which among other things implies that \Wi\ S> \W\, thanks 
to (|15j) . Next, by passing to just one of these components, we may assume that 
m = 1, thus we may reduce without loss of generality to the case where 

(17) n(w,w') = (c+0(\F\- l / 2 ))\V\ 

for all but 0(|F| _1 |W^| 2 ) pairs (w, w') £ W x W, and some standard rational c. We 
can of course assume that W is non-empty, as the claim is vacuously true otherwise. 

Now let / : W — > *M be any nonstandard function of mean zero and bounded in 
magnitude by 1, and consider the nonstandard sum 

£ I £ l E (v,w)f(w)\ 2 . 
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We may rewrite this expression as 

E f(w)f(w')n(w,w'). 

Applying (Q2]), we have fi(w,w') = (c + C>( |F | - 1 / 2 ) ) | for all but 0(\F\-y 2 \W\ 2 ) 
pairs (w,w'). For these exceptional pairs, we use the crude estimate fi(w,w') — 
0(\V\) = (c + 0(l))\V\. We conclude that 

£ | £ l E(v,w)f{w)\ 2 = £ fHf^')c + 0(\F\-^\V\\W\ 2 ). 

But as / was assumed to have mean zero, the first sum vanishes, and so 

£ | J2 l E (v,w)f( W )\ 2 ^\F\-^\V\\W\ 2 . 
vev(F) w ^ w 

In particular 

£i vj i £ ( V;U ;)/Hi 2 «|Fr i /2|y||iy| 2 . 

v£V w£W 

Next, we apply Proposition [27] again, but with the roles of V and W reversed, to 
partition V into finitely many definable components Vi, . . . , such that, for any 
1 < 5: m i there is a standard rational such that 

\{w G W | («,«;), («',«) G £7}| = (cy + 0(|F|-V 2 ))|^| 

for all (v, v') G Vi x V} outside of a subvariety of V x V of dimension strictly less than 
2dim(V). As before, we may assume that each Vi has Zariski closure of dimension 
dim(V), so that \V%\ S> \V\ thanks to (fTSl) . By arguing as above, we conclude that 

£ \^2l E (v,w) 9 (v)\ 2 ^\F\-^ 2 \V\ 2 \W\ 

whenever 1 < i < m and g : Vi — > *K is a nonstandard function of mean zero 
bounded in magnitude by 1. By Cauchy-Schwarz, we conclude that 

| £ £ 9(v)f(w)l E (v,w)\ « \f\-v*\v\\w\ 

v£Vi wGW 

whenever 1 < i < m and / : W — > *K, g : Vi — > *K are nonstandard functions 
bounded in magnitude by 1, with at least one of /, g having mean zero. If we now 
let A, B be arbitrary nonstandard subsets of Vi, W respectively, we can decompose 
1a into a constant component |A|/|V5| and a mean zero component 1a — \A\/\Vi\, 
and similarly decompose 1b into |B|/|W| and 1b — \B\/\W\; applying the above 
estimate to three of the four resulting terms, we conclude that 

£ £ U{v)l B {w)l E {v,w)=e i \A\\B\ + 0{\F\- 1 ' i \V\\W\) 

where 6, := \E\/\V\\W\ is the density of E in Vi X W. Note from Theorem [25] that 
6i lies within 0(|F| -1 / 2 ) of a standard rational, and by replacing A, B with Vi, W 
we see that B l lies within 0(\F\-^ 4 ) of \E D (V x W)|/|V;-||W|. Lemma [231 follows. 

It remains to establish Proposition [27l To do this, we will (after some basic 
reductions) use the Lang- Weil bound fLemma !26p to compute fJ,(w, w') in terms of a 
counting function c(w, w') that counts the number of top-dimensional geometrically 
irreducible components of a certain variety U w XyU' w , that are defined over F. 
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We turn to the details. Observe that any component E' of E that lies in a proper 
subvariety of V x W has cardinality at most 0(|F| _1 |y||W|) by Theorem [23 By 
Chebyshev's inequality, we thus see that 

\{veV(F)\(v,w)€E'}\ = 0(\F\- l / 2 \V\) 

for all but 0( |F | — 1//2 1 T^K | ) elements w G W. From this, we see that E' has a 
negligible impact on the conclusions of Theorem 1271 Thus we may freely delete 
any strict subvariety of V x W from E if we wish (i.e. we may work with generic 
subsets of V x W). 

By Theorem I25[ we may write E in the form 

( 18 ) 

E = {(v,w) G 7(F) x V(F) | 3*i,..., t m G F : P l {v 1 w,t i ) = for all i = 1, . . . ,m} 

for some finite collection of polynomials Pi, ... , P m lyxlfxF^r defined over 
F. For any one of these polynomials P,, consider the set of pairs (v,w) for which 
the one-dimensional polynomial ti H ► Pi(v,w,ti) vanishes. This is a subvariety of 
V x W defined over F. If it is all of V x W, then the polynomial Pi is redundant in 
(fTS)) and can be deleted, so we may assume that it is a strict subvariety of V x W. 
Thus, by deleting all such varieties as discussed previously, we may assume that E 
actually takes the form 

(19) E = {(v,w) € Q(F) | 3*i, . . .,t m G F : Pi(v,w,ti) = for alH = l,...,m} 

where O is a Zariski-dense subvariety of V x W defined over F, and the polynomials 
U i y Pi{v, w, ti) are non- vanishing for any (v, w) £ £1. If we then define 

U := {(v, w,ti, . . . ,t m ) eOx W 71 I Pi(v, w,U) = for alH = 1, . . . ,m} 

then U is a quasiprojective subvariety of £1 x F m defined over F, and we have 

(20) E = it(U(F)) 

where tt : fix F m — ^ S7 is the projection map, which is quasi-finite in the sense that 
the fibres ir^ 1 ({(v,w)}) of tt are finite (hence zero dimensional) for all (v,w) E O. 
In particular, U has dimension at most dim(y) + dim(W), thanks to Lemma [12] 
For any w £ W, we may form the quasiprojective variety 

u w :={(«,*) e v x r | («, w, t) e u}, 

and for any w, w' G W we may then form the fibre product 

U w x v U w , :={(v,t,t') eFxf xf | («,t) G U w ,(v,t') G {/„/}. 

These are quasiprojective varieties which are quasi- finite over V and so have di- 
mension at most dim(V r ). We form the counting functions 

*W(«) := \{(t,t') G F m x F m | (v,t,t') G U w x v U w ,}\. 

Then ^^'(w) is finite for all v G V(F), and thus (by countable saturatiorf^l 
see Lemma I19[) is uniformly bounded. On the other hand, from (|20[) we have 
v w ,w'(v) 7^ if and only if (v,w), (v,w') G E. Thus by (fTl)]) . we have 

H(w,w')= ^2 1 ^,„'(")#o- 
»e7(F) 



^^One can also use bounds on the degrees of the algebraic varieties involved here, if desired. 
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As v w ,w'(v) is uniformly bounded, we can express l Cjo w ,(v)^o as a standard linear 
combination of v w ,w'('v) k for finitely many standard natural numbers k, and so 
H(w, w') is a standard linear combination of the moments X^eVYF) v w,w' (v) k - Thus, 
to prove Proposition [571 it suffices (by Theorem [55)1 to show that for each standard 
natural number k, we can partition W into a (standard) hnite number of definable 
subsets Wi, . . . , W m such that, for any 1 < i,j < m, there is a standard rational 
Cijk such that 

£ v w , w ,(v) k = (c ijk +0(\F\-V 2 ))\F\ d ™W 
neF(F) 

for all but 0(\F\-^ 2 \W\ 2 ) of the pairs (w,w') € Wi X Wj. 
Observe that 

where U w .k is the fc-fold fibre product of U w over V, 

U w , k ~{(v,h,...,t k ) eVx (F m ) fe | (v,w,U) e t/ for alH = l,...,fc}, 

and 

U Wtk x V U w >,k ■= {(v,t,t') eV x (F m ) k x (F™) fc | (v,t) e l7 Wjfc) (*;,*') e U w ,, k }. 
By Lemma we conclude that 

J2 »™Mv) k = (c^^o + odFr 1 / 2 ))^!^) 

where c(w, u>') are the number of geometrically irreducible components of U Wt k Xy 
U w >,k which are defined over F. To establish Proposition [27j it thus suffices to 
establish the following claim: 

Proposition 28 (Second reduction). Let F be a nonstandard finite field of charac- 
teristic zero, and let V, W, W be definable sets over F. Let m be a standard natural 
number, let U be a subvariety of V x W x F defined over F which is quasi-finite 
over V x W , and let U' be a subvariety of ' V x W x F defined over F which is 
quasi-finite over V x W. For any w £ W and w' £ W, set 

U w :={(v,t) eVxF m \(v,w,t)e U}, 
U' w , :={(v,t')EVxT n \(v,w',t')EU'} 

and 

U w x v U' w , :={(v,t,t') eVxF m x F™ | (v,t) e U w ,(v,t') € U' w ,} 

and let c(w, w') be the number of dim(V) -dimensional geometrically irreducible com- 
ponents of U w XyU' w i that are defined over F. Then one can partition W into a 
(standard) finite number of definable subsets Wi, . . . , W m and W' into a (standard) 
finite number of definable subsets W[, . . . , W m , such that, for any 1 < i < m and 
1 < i' < to', there is a standard natural number Cn* such that c(w,w') = en' for all 
but OdF^^lWWW'l) of the pairs (w,w') £W l xW' l ,. 
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Indeed, after replacing U and U' with the /c-fold fibre product 

U k :={(v,w,ti,...,t k ) &nx (F m ) k I (v,w,U) e U for all i = 1, . . . , fc} 

and applying the above proposition once for each k (and with W = W), we ob- 
tain the required structural decomposition of Y^veV v w,w'{v) k (after intersecting 
together all the partitions obtained). 

It remains to establish Proposition 1281 The next reduction is to remove the re- 
quirement that the sets Wi,..., W m and W[, . . . , W' m , in the partitions of W, W 
are themselves definable or even nonstandard, at the slight cost of upgrading the 
0(|F|-V2|W1|W'|) error to C( |F | - 1 1 | W|) ■ In other words, we will deduce 
Proposition |2"51 from the following assertion. 

Proposition 29 (Third reduction). Let F, V, W, W'm, U, U', c be as in Proposi- 
tion \21A Then one can partition W into a (standard) finite number of subsets 
Wi, . . . , W m , and W into a (standard) finite number of subsets W{, . . . , W' m ,, such 
that, for any 1 < i < m and 1 < %' < m! , there is a standard natural number en* such 
that c(w,w') = c u > for a/0 but O^F^ 1 \W\ \W'\) of the pairs (w,w') eWiXW{,. 

The freedom to allow the partitions of W, W to not be definable or nonstandard 
will be of technical importance later in the argument, when we will use the axiom 
of choice to force a "coordinate system" on various relevant objects needed to 
compute c(w, w'); such coordinate systems will not necessarily be "definable" or 
even "nonstandard" , but thanks to the above reduction, this will not be an issue. 

Let us assume Proposition [55] for now and see how it implies Proposition [55] 
The key observation is that the level sets {(w,w') € W x W \ c{w,w r ) = c } of 
the function c are themselves definable subsets over F, as the property of a set 
cut out by a number of polynomial equations being geometrically irreducible and 
definable over F can be expressed as a first-order sentence in the coefficients of 
these equations. (Note from countable saturation that that the complexities of all 
the irreducible varieties involved in a decomposition of a given variety of bounded 
complexity is necessarily bounded, and so the first-order sentence involved is finite 
in length.) Next, we apply Propositionl29l to partition W and W' into finitely many 
pieces Wx,...,W m and W[, . . . jW^,, not necessarily definable or nonstandard. 
By hypothesis, we can find a nonstandard subset £ of W x W' of cardinality 
0( |F | ~ 1 1 1^ 1 1 | ) with the property that whenever 1 < i < to, 1 < i' < m' and 
(w,w') e {Wi x W{,)\E, we have c(w,w') = c li >. 

By Markov's inequality, we see that outside of a exceptional subset E' of W' of 
cardinality 0{\F\'^ 2 \W'\), we have (w, w') <£ E for all but 0(|F| - 1 / 2 ! W|) elements 
of w 6 W. For any 1 < i' < to, we set w'^ to be an arbitrarily chosen element of 
W[i \E' if this set is non-empty, or an arbitrarily chosen element of W otherwise. 
By construction, we see that for all w' £ W'\E' , there exists 1 < i' < m such that 

c(w, w') — c(w, w^,) 

for 

c(w, w'^) — c{w, w') 



Because Wi and Wi, are not assumed to be nonstandard sets, one has to be careful about 
what this means, since Wi and W' it need not have a well-defined cardinality. What we mean 
here is that the set of exceptions (w,w') £ Wi X W!, for which c(w,w') ^ c u i has an outer 
cardinality of 0(|F| _1 \W\ \W' |), in the sense that it is contained in a nonstandard set of cardinality 
OOFI^IWHWI). 
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for all but OGFI- 1 / 2 !^!) values of w G W. 

For each 1 < i' < ml and natural number Co, the level sets {w G W \ c(w, w'^) = 
Co} is a definable set. These definable sets generate a partition of W into finitely 
many definable subsets W\ , Wm . By construction, we see that for all but 
0(|F| -1 / 2 |iy |) values of w' G W, the function w i-> c{w,w') is constant out- 
side of a set of cardinality 0(|F| _1 / 2 |J^|) n each of the W\, . . . , Wfh- By sym- 
metry, we may also partition W into finitely many definable subsets W{ , . . . , W', 
with the property that for all but 0(|F|~ 1//2 |"H^|) values of w G W, the function 
w' i — ^ c(w,w') is constant outside of a set of cardinality 0(|F| — 1//2 |T4^'|) on each of 
the Wi,...,W^,. 

By Thcorem[25l the definable sets W either have cardinality » or 0(|F| — 1 1 
Any sets of the latter form can be harmlessly absorbed into one of the sets of the 
former form, so we may assume that all sets Wi have cardinality | W | . Similarly 
we may assume that all the sets W' v have cardinality ^> \W'\. 

Now we double-count, for any f < i < m and 1 < i' < m', we see that 

\{(w u w 2 ,w[,w' 2 ) g WtxWixw^xwl, | cK^D^cK,™;)}! « iFr 1 / 2 !^! 2 ^'! 2 

(because of the constancy properties of w H > c(w,w' 1 ) for most and 
\{(w 1 ,w 2 ,w' 1 ,w 2 ) G fjXWiX^x^ I c{w 2 ,w[) ^ c(w 2 ,w' 2 )}\ « |F| - 1 / 2 \W | 2 |W | 2 
(because of the constancy properties of w' i— > c(u>2, u/) for most W2) and thus 

Iftitfi.ufcX.t^) G W[,xW' v xW^,xW^ I cfaiX) = c(«; 2 Xi) = c(u, 2 ,™ 2 )}| = ^{W^-O^ 1 / 2 ^) 
and thus by the pigeonhole principle we can find w 2 , w' 2 such that 
|{(«;iXi) e^iX^I c(ibiXi) = cK,w 2 )}| = I Will W,' I - OdFl-^I^HWI) 

and so c is constant on Wi x W/, outside of a set of cardinality 0(|F| -1 / 2 |W| |W'|), 
and the claim follows. 

It remains to establish Proposition [29l We can now remove all references to 
definability by passing to Zariski closures, and reduce to establishing the following 
fact: 

Proposition 30 (Fourth reduction). LetF be a nonstandard finite field of charac- 
teristic zero, and let V, W, W' be affine varieties defined over F. Let d be a natural 
number, and let U, U' be subvarieties ofVxWxF and V x W' x F respectively 
which are defined over F and quasi-finite over V x W and V x W' respectively. For 
any w G W and w' G W' , set 

U w :={(v,t)eV xF d \{v,w,t)eU}, 

U' w , :={(v,t')eVxF d \(v,w',t')eU'} 

U w x v U' w , := {(v,t,t') G V x F d x F* : (v,t) G U w , (v,t r ) G U' w ,}, 

and let c(w, w') be the number of dim(V) -dimensional geometrically irreducible com- 
ponents of U w XyU' w , that are defined over F. Then one can partition W into a 
(standard) finite number of subsets W\, . . . , W m (not necessarily definable or non- 
standard) and W' into a (standard) finite number of subsets W{, . . . , W' m , such 
that, for any 1 < i < m and 1 < i' < m! , the function c is generically constant 
on Wi x W-, (i.e. it is constant in Wi x W[, outside of a subvariety ofWx W' of 
dimension strictly smaller than dim(W) + dim(H // ) ). 
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Indeed, Proposition [29] follows from Proposition [30] by specialising to a defin- 
able subset over F and using Lemma [26] to control the (outer cardinality of the) 
exceptional set. 

Now we prove Proposition 1301 Our strategy is to work generically and improve 
the nature of the varieties U Wl U' w , lying above V, until they become finite etale 
covers of certain Zariski-dense subvarieties of V. At that point, we can use the 
theory of the etale fundamental group (Appendix [A| to obtain the required local 
generic constancy of the counting function c. 

We turn to the details. First, we may decompose V into geometrically irreducible 
components. Any component which has dimension less than dim(y), or which is 
not defined over F, gives a zero contribution to c. Thus we may discard these 
components, and reduce to the case when V is a single geometrically irreducible 
affine variety defined over F. For similar reasons, we may also reduce to the case 
where W, W are geometrically irreducible affine varieties defined over F. 

Next, we observe that we may freely delete any closed subvariety from U of 
dimension at most dim(V) + dim(W) — 1 without affecting the conclusion of the 
proposition. Indeed, for generic w G W, this deletion will only remove a set of 
dimension at most dim(y) — 1 from U w and hence from U w x v U' w , for any w' € W , 
and hence will not affect c(w, w') for generic w. Similarly, we may delete any closed 
subvariety from U' of dimension at most dim(V) + dim(W) — 1. 

Next, we work to make U smooth. Given an affine variety V C k n and a point 
p in V, define the tangent space T p V of V at p to be the vector space m/m 2 , 
where m is the space of polynomials in k[V] that vanish at p. We say that p is a 
smooth point of V if T p V has dimension dim(V), and a singular point otherwise. 
A quasiprojective variety U is said to be smooth if every point of U is a smooth 
point of U. Note that a point that lies in two or more components of an affine 
variety cannot be a smooth point of that variety, so the irreducible components of 
a smooth quasiprojective variety are necessarily disjoint. 

The variety U has dimension at most dim(V) + dim(W). As is well known, the 
set of singular points of U must have dimension strictly less than this (see e.g. [Ml 
Theorem 5.6.8]); here is one place where we crucially use the hypothesis that F 
has characteristic zero. By deleting these points, we may thus assume that U is 
smooth and has dimension exactly dim(V) + dim(W); in particular, the irreducible 
components of U are now disjoint. Similarly, we may assume that U' is smooth 
and has dimension dim(V) + dim(W'). In particular, the projections of U and U' 
to V x W and V x W' respectively are now dominant maps, in the sense that their 
images are Zariski dense. 

By again using the hypothesis that F has characteristic zero, the set of points 
u G U where the derivative dir(u) of the projection map 7r : U —> V x W does not 
have full rank, has dimension strictly less than dim(F) + dim(W^) (see e.g. [35] 
III 10.7]), so by deleting these points we may assume that dcf> is everywhere non- 
singular, or in other words that 7r : U — > V x W is an etale map. Similarly, we may 
assume that the projection ir' : U' — > V x W' is also etale. 

The projections tt, tt' are currently quasi-finite and etale. We will need to upgrade 
the quasi-finiteness property to the stronger property of finiteness. We quickly 
review the relevant definitions: 

Definition 31. Let V C k n be quasiprojective variety over an algebraically closed 
field k. A quasiprojective variety is abstractly affine if there is a regular isomorphism 
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between it and an affine variety. A regular morphism <j) : V — > W is finite if one 
can cover W by open, abstractly affine subvarieties Wi, such that (j)~ 1 (Wi) is also 
abstractly affine, and the ring fc[0" 1 (M /r i)] is a finite /c[Wi]-algebra (where we use <\> 
to pull k[Wi] back into fc[0 _1 (Wi)] in the obvious manner). 

Example 32. The inclusion of &\{0} into k is quasi-finite and etale, but not finite, 
because the ring fc[/c\{0}] = k[x, —] is not finite over k[k] = k[x]. A finite morphism 
in algebraic geometry is analogous to the notion of a covering space (with finite 
fibres) in topology; note for instance that the inclusion of C\{0} into C is also not 
a covering space. 

We have the following basic fact: 

Lemma 33. Let <j> : U — > V be a quasi-finite regular morphism between two 
quasiprojective varieties U, V which is dominant. Then there exists an open dense 
subvariety V' of V such that the restricted map 4> ■ §~ (V) —> V is finite. 

Proof. We adapt the proof of [SHI Theorem 1.5.3.6]. By passing to an open dense 
abstractly affine variety of V (such as V with a codimension one closed subvariety 
removed) we may assume that V is abstractly affine. Let k(V) denote the field of 
fractions of k[V] (i.e. the rational functions on V), and similarly define k(U). By 
the hypotheses on 0, k(U) is an algebraic extension of k{V) (pulled back by 0, of 
course), and so on clearing denominators one can find a finite set of generators for 
k[U] that become integral over k[V] after multiplying by a non-zero regular function 
/ in k\V\. Removing the zeroes of / from V (thus adding 1/f to k[V], and keeping 
V abstractly affine), and removing the corresponding preimage from U , we obtain 
the claim. □ 

Applying this lemma to the map tt : U — > V x W, we can remove a lower- 
dimensional piece from U and assume without loss of generality that tt : U — > tt(U) 
is not just etale, but is finite etale, thus the smooth variety U is a finite etale cover 
of tt(U). Similarly, we may make tt' : U' — >• tt'(U') a finite etale covering map. 

Since tt : U — > tt(U) is a finite etale covering map of smooth varieties, the 
restriction ny : U w — > ttv(U w ) is also a finite etale map of smooth varieties for 
any w £ W, where ny ■ V x F m — > V is the projection onto V. Similarly, ttv ■ 
~^ n v(U w ,) is finite etale for any w' £ W', which implies that the fibre product 
ttv ■ U w Xy U' w , — > 7Tv(U w xy U' w ,) is also finite etale. (Here we have used the 
fact that finiteness and the etale property are both preserved with respect to base 
change and composition; see e.g. [1H1 Propositions 1.1.3, 1.3.3].) 

The set <f>(U) is a Zariski-dense subvariety of V x W. Applying Lemma [T3l 
we conclude that for generic v £ V, one has (v,w) £ <f>(U) (or equivalently, v £ 
<t>v{Uw)) for generic w £ W. Similarly, for generic v £ V, one has v £ 4>v{U' w ,) for 
generic w' £ W. Thus, we may find a point p £ V such that p £ 4>v(U w xy U' w ,) = 
<t>v{Uw) n^y(J7^,) for generic w £ W and w' £ W . Indeed, by Lemma we may 
take p to be an F-point of V. 

Fix this point p. For generic w £ W and w 1 £ W, the fibre of of U w Xy U' w , 
over p is non-empty, and may be identified with the Cartesian products S w x S' w ,, 
where S w , S' w , are the finite sets 

S w :={t£F d \ {v,w,t)£U} 

and 

S' w , := {teF d | (v,w',t) eu'}. 
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As U, U' are defined over F, and p is an F-point of V, S w and S' w , are denned over 
F. In particular, the nonstandard Frobenius map FrobF acts on S w and S' w ,, and 
thus also acts on the product S w x S' w , by the diagonal action. 

As 4>v ■ U w xy U' w , — > 4>v(U w ) n <j>v{U' w ,) is a finite etale covering, the etale 
fundamental group ir\(<f>y(U w )^<l>v(U' w ,),p) acts on the fibre S w x S' w ,, by a product 
of its actions on the individual fibres S w and S' w , ; see Appendix As noted in 
that appendix, each orbit of this action is the fibre of exactly one of the irreducible 
components of U w Xy U' w ,. Thus, the number c(w,w') of such components that 
are defined over F is equal to the number of orbits of this action that are invariant 
with respect to the nonstandard Frobenius action. 

We view this number as a combinatorial quantity, which currently depends (for 
generic w, w') in a rather entangled fashion on several objects: the variety 4>v(U w )n 
<f>v(U w ,) (and more specifically, its etale fundamental group over p); the fibres S w 
and S' w ,; the action of the etale fundamental group on these fibres; and the action 
of the Frobenius map on these fibres. Our goal is to decouple the role of w and w' in 
forming c(w, w'), so that (after partitioning W and W into finitely many subsets, 
and working on a single subset of W and a single subset of W), the quantity 
c(w,w') becomes generically constant. 

Wc achieve this as follows. The first step is to (crudely) "trivialise the bundles" 
of U over V x W and U' over V x W' in a set-theoretic sense. Observe that 
for generic w, the fibre S w has constant cardinality M for some standard natural 
number M (indeed, M is just the degree of U divided by the degree of V x W). 
Thus, by the axiom of choice, we may enumerate S w = {t w i, . . . , ^,m}- By fixing 
such an enumeration, we can thus (non-canonically) identify S w with {I, . . . , M} 
for generic w. Similarly, we may non-canonically identify S' w , with {1, . . . , M'} for 
generic w' and some standard natural number M' by using the axiom of choice 
to select an enumeration S' w , = {t' w , 1 , . . . ,t' w , M ,}. Note that as we appeal to the 
axiom of choice here to build this enumeration, we do not claim or expect these 
identifications to be definable, or even nonstandard; but thanks to our reduction of 
Proposition 1281 to Proposition I29[ such definability and nonstandardness properties 
will not be needed^. 

We now fix the above enumerations of S w and S' w ,. For generic w, the action 
of the nonstandard Frobenius map FrobF on S w — {1, . . . , M} is now given by a 
permutation a w in the symmetric group Sym(M) on M elements. This permutation 
depends on w, so by partioning W into finitely many subsets (which need not be 
definable or nonstandard) , we can ensure that the map w i— > a w is constant on each 
such subset. We now pass to one of these subsets of W; thus for generic w in this 
subset, the action of FrobF on S w = {1, . . . , M} is now independent of w, when 
viewed in coordinates. Similarly, by partitioning W' into finitely many subsets and 
passing to any one of these subsets, we may assume that for generic w' in this 
subset, the action of Frobp on S' w , = {1, . . . , M'} is independent of w' . Thus, for 
generic w,w' in the respective subsets of W,W, the product action of FrobF on 

^One could avoid the appeal to the axiom of choice here by working with all enumerations at 
once, and quotienting out the objects constructed at the end of the day the equivalence relation 
given by all possible relabelings; similarly for some further invocations of the axiom of choice later 
in this argument. However, we will not choose this "coordinate-free" route here as it requires the 
use of more complicated notation, opting instead for a less elegant, but more direct "coordinate- 
heavy" approach, which is more unnatural from an algebraic geometry perspective, but more 
convenient from a combinatorial one. 
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S w x S' w , = {1, . . . , M} x {1, ... , M'} is independent of both w and w' . To obtain 
the desired local constancy of c(w, w'), it thus sufhces to show (perhaps after further 
finite partition of W and W) that for generic w,w' in their respective subsets of 
W,W, the set of orbits of the etale fundamental group TTi(<pv(U w ) D 4>v(U' w ,),p) 
on S w x S w > = {1, . . . , M} x {1, . . . , M'} is actually independent of the choice of w 
and w' . 

The main difficulty here, of course, is that the group 7Ti(0y {U w ) n <j>v{U' w ,),p) 
depends on both w and w' in a coupled fashion. To decouple the role of w and w' 
here, we would like to use the etale van Kampen theorem (Theorem [50]) , but first 
we must understand how the sets 4>v(U w ) and 4>v{U' w i) intersect each other. 

The set <j>(U) is an open dense subvariety of V x W, and can thus be written 
as 4>{U) = (V X W)\H for some closed subvariety of V x W of dimension at most 
dim(V) + dim(VF) - 1. We split E = (Eo X W) U Ei, where E x W is the union 
of all the irreducible components of E that are of the form H x W for some closed 
subvariety H of V, and Ei is the union of all the other irreducible components of 
E. Informally, Eo represents the portion of E that does not depend on the w G W 
coordinate, while Ei represents the portion which is non-trivially dependent on this 
coordinate. Note that Eo has dimension at most dim(T^) — 1, and Ei has dimension 
at most dim(V) + dim(VF) — 1. We then have 

<t>v(U w ) = F\(E U Ei, w ) 

for any w € W, where 

Si,™ ■■= {v &V\ (v,w) g Ex} 
is a slice of Ei. Similarly, we may write 

^(f/;o = n(s us' w ) 

for all w' € W, where E is a closed subvariety of V of dimension at most dim(F) — 
1, Ej is a closed subvariety of V x W 1 of dimension at most dim(V) + dim(M // ) — 1 
consisting entirely of components that are not of the form H x W' for any H , and 

E^, :={«eV|(t;X)eEi} 

is a slice of E' x . 
We now have 

Mu w ) n MK>) = V\02i, w u Ei t „,) 

where V is the open dense subvariety of V defined by 

V := n(So U E ). 

We can now apply the etale van Kampen theorem (Theorem [SHI) and conclude 
that for generic w,w', tti(4>v(U w ) n (f>v(U' w ,),p) surjects onto the fibre product of 
7ri(V'\Ei lW ,p) and TTi(V'\S' ltW ,,p) over m(V'\(^ w n Z' 1>w ,),p) (with respect to 
the obvious homomorphisms between these groups). 

Next, we make the crucial observation that for generic w, w' , the set Ei^nE^ , 
has dimension at most dim(y) — 2 (i.e. it has codimension at least 2 in V). Indeed, 
for generic w, Ei iTO has dimension at least dim(V^) — 1. Given a top dimensional 
component H w of Ei iUJ , H w will not be a component of E' x w , for generic w' unless 
H w x W' is contained in E' 1; which contradicts the construction of E' x . Thus the 
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intersection of H w with any component of Ej w , will generically have dimension at 
most dim(l / ) — 2, and the claim follow^. 

Because the codimension of E ljU , H E' x w , is generically at least 2, we may invoke 
Lemma [¥7] and conclude the isomorphism 

7ri(V^(Ei ltB nE' lit0 ,),p)=7ri(V',p) 

for generic w,w', using the obvious homomorphism from the former group to the 
latter. We conclude that for generic w,w', nx{<t>v(U w ) n <j>v(U' w i),p) surjects onto 
the fibre product of iri(V'\Y^i. Wl p) and 7ri(l / '\E' 1 w ,,p) over wi(V',p), with respect 
to the obvious homomorphisms between these groups. On the other hand, observe 
that 7ri(V'\Ei jU) ,p) acts on S w (as U w is a finite etale covering over V'\Ei )t „), 
and iri(V'\E' lw ,,p) acts on S' w ,, and so the fibre product of 7ri(V'\Ei )1t „p) and 
7Ti(V r '\E' 1 w ,,p) over tvi(V',p) acts on S w x 5^,, by the product action. From the 
compatibility of the etale fundamental group actions on fibres (see Appendix |Aj 
we see that the action of n 1 ((pv(U w ) n (f>v(U' w t),p) on S w x S' w , factors through this 
product action. From the surjectivity mentioned earlier, we conclude an important 
fact: 

Proposition 34. For generic w,w' , the set of orbits of ni(4>v (U w ) H <j>v{U' w i),p) 
on S w x S' w , is equal to the set of orbits of the fibre product of iTi(V'\Ei jW ,p) and 
7n(F'\E' w ,p) overir x {V',p). 

In view of this proposition, the only remaining task needed to establish Proposi- 
tion [50] (and thus Lemma |2"3"|) is to show that, after further finite subdivision of W 
and W into subsets, and for generic w,w' in respective subsets of W, W, the set 
of orbits of the fibre product of wi(V'\Ei, w ,p) and 7Ti(F'\Ej w ,,p) over ni(V',p) 
on S w x S' w , = {1, . . . , rn} x {1, . . . , to'} is independent of both w and w' . 

At first glance, it may seem that one would need a rather precise understand- 
ing of the nature of the etale fundamental group TTi(V'\T,i tW ,p), and how it sits 
over TTi(V',p) by the obvious surjective homomorphism, and how it acts on S w . 
Fortunately, however, we only need a small amount of information on this group 
and this action. Namely, let H w be the kernel of the surjective homomorphism 
from TTi(V'\Ei tW ,p) to 7Ti(V"',p). This normal subgroup of m(V'\T,i !W ,p) acts on 
S w = {1, . . . , M}; let be the equivalence relation on {1, ... , M} induced by this 
action (so that two elements of {1, . . . , M} are equivalent by ^ w if there is an ele- 
ment of H w that moves one to the other) . There are only finitely many possibilities 
for this equivalence relation, so by partitioning W further into finitely many subsets 
and passing to one of these subsets, we may assume that ^w—^ is independent of 
w for generic w in this subset. Similarly, letting H' , denote the kernel of the homo- 
morphism from 7Ti(V^'\E' 1 w ,,p) to TTi(V',p), we may assume that the equivalence 
relation ~' on {1, . . . , M'} = S w > induced by H' w , is generically independent of w', 
after passing to one of the finitely many subsets partitioning W'. 

For generic w, the action of Ki(V'\Eii ;W ,p) on {1, . . . , M} now descends to an 
action p w of the quotient group TTi(V',p) on the quotient space {1,...,M}/ ~, 
and similarly for generic w' we have an action p' w , of ni(V',p) on {1, . . . , M'}/ ~'. 
An orbit of the fibre product iri(V'\Ei tW ,p) and 7Ti(y\E' 1 w ,,p) over tti(V ,p) in 



Here we have used the obvious fact that the set of pairs (w, w') for which Si jTO n Sj , has 
dimension more than dim(V) — 2 is a constructible set, so that we may apply Lemma ll3l 
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{1, . . . , M} x {1, . . . , M'} can now be written in the form 

|J u-\ Pw (g)x)x(n')- 1 (pU9)y) 

g£iri(V\p) 

where x is a point in {1, . . . , M} / ~, y is a point in {1, ...,M'}/ ~', and II : 
{1,...,M} ->■ {1,...,M}/ - and n' : {l,...,M'} ->■ {1,...,M'}/ ~' are the 
quotient maps. Such orbits are almost independent of w and w' , save for the need 
to specify the actions p w , p' w , of 7Ti(V,p) on {1, ... , M}/ ~ and {1, . . . , M'}/ ~'. 
But now we use the crucial fact (see Proposition |46| that wi(V',p) is topologically 
finitely generated, so that in order to specify an action such as p w on a finite 
set such as {1, . . . , M}/ ~, it suffices to specify the action of a finite number of 
topological generators. There are only finitely many such possibilities for such 
actions, so after partitioning W further into finitely many subsets and passing to 
one of these subsets, we may assume that p w is in fact independent of w for generic 
w in these subsets; similarly we may assume that p' w , is independent of w' for 
generic w' in one of the finitely many subsets partitioning W' . Now, the orbits in 
{1, . . . , Al} x {1, . . . , M'} are completely independent of w, w' for generic w, w' in 
their respective subsets, giving Proposition 1301 and hence Lemma [521 

6. Extension to higher dimensions 

We can iterate Lemma [23] to obtain a higher dimensional version: 

Theorem 35 (Regularity lemma, higher dimensional version). Let F be a nonstan- 
dard finite field of characteristic zero, let d, k > 1 be a standard natural number, 
let V\, . . . , Vd be definable sets over F, and let E\, . . . ,Ek be definable subsets of 
V\ X . . . X Yd- Then for each 1 < i < d, one can partition Vi into a finite number of 
definable sets V^i, . . . , Vi^ ai for some standard natural number ai, with the following 
property: for any natural numbers j\ , . . . , jd with 1 < ji < ai for each 1 < i < d 
and 1 < I < k, there exists a standard rational number < o~ij li ...j d < 1 with the 
property that 

(21) 125, n (Ax x ... x A d )\ = ^....JAxI . . . \A d \ + O^- 1 ^ . . . \V d \). 

for all nonstandard sets Ax, ... , Ad with Ai C Vij i for all 1 < i < d. 

Furthermore, we may ensure that \Vij\ S> \Vi\ for all 1 < i < d and 1 < j < d£. 

Remark 36. We stress that this lemma is not analogous to the full "hypergraph 
regularity lemma" that generalises the Szemeredi regularity lemma [51], [52], [26], 
[63] . but is instead more analogous to the earlier hypergraph regularity lemma of 
Chung [10] (see also [20]); see the recent paper [45] for some discussion of the 
hierarchy of different hypergraph regularity lemmas. 

Proof. We first remark that the final conclusion \Vij\ ^> \Vi\ can be obtained "for 
free" as follows: by Lemma [26] we have either \Vij\ > \Vi\ or \V id \ < |F| _1 |V^| 
for each i,j. Any Vij that obeys the latter bound instead of the former can be 
absorbed without difficulty into one of the sets Vij in the partition that obeys 
the former bound (and, by the pigeonhole principle, at least one of the Vij will 
obey that bound), without affecting the regularity property ([2~T]) . Thus, to prove 
Theorem I35[ we may ignore the final requirement that \Vij\ 3> \Vi\ for all 

We now induct on d. The case d = 1 is trivial, so suppose d > 2, and the claim 
has already been proven for d — I . 
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Next, we observe that to prove the theorem for a given d, it suffices to do so when 
k = 1, as the higher k case follows by applying the theorem to each E\ separately 
and then intersecting together all the definable subsets Vij of Vi produced by this 
theorem. Thus we may assume k = 1, and abbreviate E\ as E. Applying Lemma 
[23] with V and W set equal to V! x ... x Vd-i and Vd, we may partition 

FiX...xy (i _ 1 = £(U...U E' k , 

and 

V d = V dtl U . . . U V d ,a d 
where k' , ad are standard natural numbers and the E'j, V d j are definable sets with 
the property that for each 1 < f < k' and 1 < jd < ad there exists a standard 
rational number < oy.j d < 1 such that 

(22) \E n {A' xA d )\= a jldd \A'\\A d \ + 0(|F| - 1 / 4 | . . . \V d \) 

for all nonstandard sets A' C E'y and Ad C V d ,j ■ 

Next, we apply the induction hypothesis to the E[, . . . , E' k , to obtain partitions 
Vi = Vi t i U . . . U Vi >ai for i = 1, . . . , d — 1 into definable sets with the property that 
for any j\, . . . ,jd-i with 1 < ji < ai for each 1 < i < d — 1, and each 1 < j' < k', 
there exists a standard rational < af j t < 1 such that 

(23) 

\E' y n (A, X ... X A d _x)| = • ■ • \A d -i\ + OdFr^l^l . . . |Vrf_i|) 

whenever A±, . . . , are nonstandard sets with Ai C V, l j i for all 1 < i < d — 1. 

Now suppose that 1 < ji < for i = 1 , . . . , d, and let A\, . . . , Ad are nonstandard 
sets with Ai C T^.^ for all 1 < i < d. We compute the quantity 

\En(A ± x ... x 

Intersecting Ai X . . . X A^-i with each of the E'j, and using (|22|) . we can write this 
expression as 

k' 

£ ^ Jd |^, n (At x . . . x A d ^)\\A d \ + 0(|F|-V4|^| 

Applying ([2^]) . we can simplify this to 

a 3l _ 3d \A l \...\A d \ + 0{\¥\- l ' i \V 1 \...\V d \) 

where 

k' 

Jd := a fdd <J j',jl,'-;jd-l 
J'=l 

and the claim follows (note that crj lt ... l j i can be adjusted if necessary to not exceed 
1, since |J5 n (Ai x . . . x is trivially bounded by |^i| . . . |^d|)- □ 

For computational purposes, it is convenient to rephrase (|21j) as follows. 

Corollary 37. Let the notation, hypotheses, and conclusion be as in Theorem \35\ 
Then for any nonstandard functions fi : Vi — > *C for i = 1, . . . , d with \fi(x)\ <^ 1 
for all x € Vi, and any 1 < I < k, the expression 

^2 l El (xi,...,x d )fi(x 1 )...fd{xd) 

(x 1 ,...,x d )GV 1 x...xV d 
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is equal to 

£---X>.* II E fi{x i )) + 0{\V\- 1 ' i \V x \...\V d \). 

Proof. By decomposing each /i, we may assume that /i is real, non- negative, and 
bounded by 1, and supported on a single set Vi.j i for some 1 < ji < a;. Our task is 
then to show that 

E l Bi (a;i,...,a; c i)/i(a;i).../ c i(a; ( i) 
(n,..,i d )eyij 1 x...xy dJ[i 

is equal to 

d 

"Lr II £ / i (^))+0(|F|- 1 /4| Vl ,...| Kf | ) . 

By expressing each /; as a (nonstandard) integral /; = f Q lf t >t dt of (nonstandard) 
indicator functions, we may reduce to the case where each fi is an indicator function. 
But the claim then follows from (|2Tj) . □ 

6.1. Expanding definable maps. We now apply the above regularity lemma to 
establish the following dichotomy for definable maps. Given two definable sets V, W 
over a field F, call a function / : V — >■ W definable if its graph {(v, f(v)) \ v g V} 
is a definable set. 

Theorem 38 (Expansion dichotomy) . Let F be a nonstandard finite field of char- 
acteristic zero, and let V, W, U be geometrically irreducible quasiprojective varieties 
defined over F. Let P : V x W — > U be a regular map defined over F. Then at least 
one of the following statements hold: 

(i) (Algebraic constraint) The set 

{(P(v,w),P(v,w'),P(v',w),P{v',w')) | v,v' e V;w,w' € W} 

is not Zariski dense in U A . 

(ii) (Moderate expansion) There exists a partition of U{F) into finitely many 
definable subsets U(F) = UiU. . .UU m with \Ui\ > |Z7(F)| for all 1 < j < m, 
with the property that for any nonstandard sets A C V(F), B C iy(F), 
there exists 1< j <m with 

|C/AP(A,fl)| « |F|-Vie ( | V (P)|/|A|) 1 /2(| W r(F)|/|B|) 1 /2j ? 7 : .|. 

Ln particular, we have the following moderate expansion property: if\A\\B\ 3S> 
|F|- 1 /8|l/(F)||iy(F)|, then \P(A,B)\ » \U(F)\. 

Proof. Assume that conclusion (i) of that theorem fails. We consider the set 

S := {(v,v',w,w',P(v,w),P(v,w'),P(v',w),P(v',w')) | v,v' € V;w,w' eW}d V 2 xW 2 xU 4 . 

This is a graph of a regular map from V 2 x W 2 to U A and is thus (by Proposition [T2)) 
an irreducible constructible set of dimension 2 dim(y) + 2 dim(M / ). By hypothesis, 
the projection of this set to C/ 4 is Zariski dense, and thus the projection map 7r from 
S to U is dominant. Thus, outside of a sub variety A of U of dimension strictly 
less than 4dim(y), the fibres of tt are 2dim(V) + 2dim(VK) — 4 dim(J7)-dimensional. 
Furthermore, 7r _1 (A) has dimension strictly less than 2dim(F) + 2dim(W / ). 
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By Lemma 126] for any F-point x £ t/(F) 4 that does not lie in A, the F-points 
7r ({x})(F) of the fibre at x have cardinality 

(24) |7T- 1 ({a;})(F)| = ( C (x) + 0(|F|- 1 /2))| F |2dim(y)+2dim(H/)-4dim(C/) ; 

where c{x) is the number of top-dimensional geometrically irreducible components 
of the fibre 7r _1 ({a;}) which are defined over F. As this is finite for every x, we see 
from countable saturation (or from degree considerations) that c(x) is uniformly 
bounded in x. Also, from the definition of c(x), it is clear that the level sets E ca := 
{x € J7(F) 4 \A(F) | c(x) — Co} are definable sets for each standard natural number 
Co (and, by the preceding discussion, are empty for sufficiently large Co). Applying 
Theorem [35] (and combining the four partitions of U obtained by that theorem), we 
may thus find a partition U(F) — U\ U . . . U U m into finitely many definable subsets 
with \Ui\ 3> |E/(F)| for all 1 < i < m, such that for any 1 < ji,j2, J3, J4 < m and 
natural number Co, there exists a standard rational number < Vco,h-ih,h,jt — 1 
such that 

\E CQ n {A l xA 2 xA 3 x A 4 )| = a C0>jl!hij3iji \AiU2U 3 \\M +0(\F\- 1 '*\U(F)\ 4 ) 

for all nonstandard subsets A\, A2, A3, A4 of Uj 1 ,Uj 2 ,Uj 3 ,Uj i respectively. From 

Corollary [37] we see that 

(25) 

4 

E /i(«i)/2(«2)/3(«a)/iM = o-coAAAAlI( E /,K))+0(|Fr 1 / 4 |(7(F)| 4 ) 

whenever, for each % — 1,2,3,4, /j : U(F) — > *R is a nonstandard function sup- 
ported on Uj i bounded in magnitude by 1. 

Fix the partition U(F) — U% U . . . U U m , let 1 < j < m be an index, and let 
/ : f/(F) — > K + be a nonstandard function bounded in magnitude by 1 that is 
supported on Uj and has mean zero. We consider the quantity 

(26) l£E 

By the Cauchy-Schwarz inequality, we may bound this expression by 

\a\ 1,2 { E 1 E /( p (^))i 2 ) 1/2 

v£V(F) wes 

which we can rewrite as 

i^ 1/2 i E E /(PM)/(p(«x))r /2 . 

w,w'£B vGV(F) 

By a second application of Cauchy-Schwarz, we can bound this expression by 

| A |l/2| B |l/2( J- I E f(P(v,w))f(P(v, W '))\ 2 )^ 
w,w'£\V(F) v£V(F) 

which we can rearrange as 

| A |1/2 |B| 1/ 2| J- f(P(v,w))f(P(v 7 w'))f(P(v',w))f(P(v',w'))\^ 

(v,v' ,w,w')£V(F) 2 xW(F) 2 

or equivalently 

ses(F) 
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where the tensor power /® 4 : J7(F) 4 — > M + of / is defined by the formula 

f m (u 1 ,u 2 ,u 3 ,u 4 ) := f(ui)f(u 2 )f(u 3 )f(u4,) 

for Ui,U2,U3,U4 € U(F). 

Since 7r _1 (A) has dimension strictly less than 2 dim(V) + 2 dim(W / ), we see from 
Lemma US] that at most 0(|F| 2dim ( y )+ 2dim ( w/ )- 1 ) of the points s e E(F) lie in 
7r _1 (A). Thus, by the boundedness of /, we may bound (1261) by 

| A |1/2|£|1/2| £ / ®4 (7r(s))+0( | F |2dim(y)+2dim W -l ) |l/4 

seS(F)\7r-i(A) 

which we can rewrite as 

|A| i/2 |jB| i/ 2| £ | 7 r- 1 ({(u 1 ,u 2 ,n 3 ,t t 4)})(F)|x 

(ui,u 2 ,«3,"4)eC r (F) 4 \A 
X /M/M/M/M + (|F| 2d ™W+ 2dim W- 1 )| 1 /4. 

Applying (|2^|) (and Lemma 121)1 to bound |J7(F)|), we can bound this by 

|^|l/2|_ B |l/2|| F |2diro(V)+2dim(W0-4di l n(t/) 

(Ul,U 2 ,U 3 ,ll4)6{7(F)*\A 

c( Ul , M2 , U 3,"4)/(«l)/( U 2)/(« 3 )/( U 4) + 0(|F| 2dim ^ +2dim ^- 1 / 2 ))| 1 / 4 . 

We can rewrite this as 

| 7 4|l/2| B |l/2|| p |2dim(V)+2dim(W)-4dim(C/) ^ Cq V" 
(27) c <C (ui,u 2 ,u 3 ,u 4 )eE C0 

/(ui)/(« 3 )/(« 3 )/(t*) + (|F| 2dim ( v )+ 2dim ^)- 1 / 2 ))| 1 / 4 . 

where Co is the largest value of Cq for which £" Co is non-empty (as mentioned 
previously, Co is a standard natural number). Applying (|25|) . we can bound the 
above expression by 

| J 4|V2|5|l/2| n F |2dini(V)+2dim(W)-l/4^|l/4 j 

and thus (by Lemma [ 



| £ J2 f( P (v^))\ « |F | - 1 / 16 1 A| V2 1 ^ (F) | V2 , ^ | V2 |^ (F) 1 1/2 

whenever / : U(F) — > *M is a nonstandard function supported on Uj, bounded in 
magnitude by 1 and of mean zero. 

For each u £ U(F), define the multiplicity function 

H(u) := \{(v, w) G A x B \ P(v, w) = u}|, 

then the above bound can be rewritten as 

| J2 /(«)/*(«)! « W-^A^ViF^B^Wm 1 ' 2 

u£Uj 

whenever 1 < j < a and / : U (F) — ► *R is a nonstandard function supported on 
Uj, bounded in magnitude by 1 and of mean zero. In particular, one has 



(28) Yl \^ ~ H\ « \^\- 1,l6 \A\ l ^\V{F)\^\B\^\W{¥)\^ 
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for all 1 < j < a and jij £ *R is the average value of n(u) on Uj, as can be seen 
by taking / to be the signum function of fJ>(u) — (J,j, normalised to have mean zero 
and bounded in magnitude by 1. 

On the other hand, by double counting we have 

m 

i^iibi = x;m«)=ei^ 

uEU j=l 

By the pigeonhole principle, we can find 1 < j < m such that 

(29) w » p^lAWB]. 
From this and (|2"8)l we see that 

(30) |{« G C/, | = 0}| « |P|- 1 /ie(|^(p)|/| i 4|)V2(| W r(F)|/|S|)V2| Cr .|. 

Since {it G {7j | fi(u) = 0} = Uj\P(A, B), the claim follows. □ 

Remark 39. From (|28|) (and bounding fij crudely by |^4||B|/|£/j|), we conclude 
the additional bound 

|{M) G AxB | P(a,b) G C}\ « J^^M + |F|- 1 /i«|A|Va|K(F)| 1 /3| S |Vaj W ( F )|Va 

whenever A c V(F),B C PF(F),C C ?/(F) are nonstandard sets and conclusion 
(i) of Theorem [35] fails. A variant of the above argument gives the more general 
bound 

\(A x B x C) n S\ « ^J^P + |F|- 1 / 16 | J 4| 1 / 2 |F(F)| 1 / 2 |i3| 1 / 2 |^(F)| 1 / 2 
\U(F)\ 

whenever S is a subvariety of V x W x U with the property that the fibres {u G 
[/ : (u, it), u) G 5} are finite for all v G V, iu G W, and such that the set 

{(•Ui, u 2 , U3, M4) G U* I 3u, u'eV;w,w'gW: (ui, v, w), (u 2 ,v 7 w'), (u 3 , v', w), (u^,v', w') G S} 
is Zariski dense in U 4 , by replacing (|26[) with the more general expression 



lEE E /mi- 

We leave the details of the above generalisation to the interested reader. Such 
bounds, in the context of subsets of C rather than of F, were studied in [15 . It is 
likely that one could use the techniques in this paper to then establish analogues 
of the main results of [18] in the context of large subsets of finite fields of large 
characteristic, but we will not pursue this issue here. 

We also have a variant that gives stronger expansion provided that one can rule 
out a second constraint: 

Theorem 40 (Second expansion dichotomy). Let F be a nonstandard finite field, 
let V, W, U be geometrically irreducible quasiprojective varieties defined over F. Let 
P : V xW — > U be a regular map defined over F. Then at least one of the following 
statements hold: 

(i) (Algebraic constraint) The set 

{(P(v,w),P(v,w'),P(v' ,w),P(v' ,w')) I v,v' G V;w,w' G W} 
is not Zariski dense in U A . 
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(ii) (Second algebraic constraint) There exist geometrically irreducible quasipro- 
jective varieties V',W',U' defined over F with the same dimensions as 
V, W, U respectively, and dominant regular maps f : V' V, g : W — > W , 
h : U' — > U defined over F such that the variety 

{(»', u/,u') G V X W X U' | P(f(v'),g(w')) = h{u')} 

is not irreducible. 

(iii) (Strong expansion) For any non-empty nonstandard sets A C V(F), B C 
W(F), one has the strong expansion property 

\U(F)\P(A,B)\ « \F\- i ' i *(\vmM) 1/a wm/\B\) 1/a \u\. 

Proof. Now we use a variant of the previous argument. We repeat all the construc- 
tion and argument in the proof of Theorem [38l If we have ([29| for all 1 < j < m, 
then we have conclusion (iii) by summing (|30[) in j. Thus we may assume that 

^^odU.nAWB]) 

for some j. Thus, for this j, we have 

1 £ / y £,l Uj {P{v,*)) = o(\A\\B\). 

Write E := {(v,w) G V(F) x W(F) | P(v,w) G [/,■}, then E 1 is a definable subset 
of V(F) x ^(F) and 

(31) \En(AxB)\=o(\A\\B\). 

By Lemmal23l we can partition V(F) into a finite number of definable sets V\, . . . , V a 
and W(F) into a finite number of definable sets W±, . . . ,Wb, such that for any 
1 < i < a and 1 < i' < b, there exists a standard rational number < o~u' < 1 with 
the property that 

(32) \En(A' x B')\ =a u >\A'\\B'\ + 0{\¥\- 1 l 4 \V(F)\\W(¥)\). 

for all nonstandard sets A' C Vi and B' C Wi'; by concatenating the Vi,Wi> if 
necessary we may assume that \Vi\ >■ |V(F)| and \Wj \ > |W(F)|. 

Specialising to A' := A fl and B' := B D Wi> and using (|31l) we see that 

« |F|- 1 / 4 |^(F)||^(F)| +o(|A||B|). 

By the pigeonhole principle we can find i, i' such that 

|AnV5| » \A\ 

and 

|BnW,/| » 
and for this choice of i, i' we thus have 

a iV « |F|- 1 / 4 (|y(F)|/|^|)(|W(F)|/| J B|) + (1). 

We may assume that (|V(F)|/|A|)(|W(F)|/|B|) < | F 1 1 1/s , otherwise conclusion (iii) 
is vacuously true; and so aw <C o(l). Since aw is standard rational, we conclude 
aw — 0. Going back to ([32]) and now setting A 1 :— Vi and B' := Wi', we conclude 
that 

|.En (V5 x W v )\ < |Fr 1/4 |V r (F)||V^(F)|, 
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or in other words that 

E E iv s («)i^Hi C /,(PK^))«|F|- 1 / 4 |y(F)||w(F)|. 

v£V w€W 

Using Theorem |25| we can write Vi = f(V'(F)) for some variety V defined over 
F with the same dimension as V, and some dominant map / from V to V, also 
defined over F;. Similarly W v = g(W'(F)) and Uj = h(U'(F)); thus 

E E 1 /(v-(F))(«)l 9( w(F))Hl/ l ( [ /'(F)( J PK^) « |F|- 1 / 4 |7(F)||W(F)|. 

From Proposition [12] we can find an open dense subvariety V" of V such that / 
is quasi- finite on V" , and thus (by countable saturation, or degree considerations) 
one has a uniform bound < C for all v G f(V"). By intersecting V" 

with its Galois conjugates, we may assume that V" is defined over F. Similarly 
we can find open dense subvarieties W", U" of W' , U' respectively defined over F' 
such that g, h are quasi- finite on W", U" respectively. Then 

E E h(v>>( F ))(v)l g (w>>(F))Mh l (u>> {F )(P(v,w)) « |F|- 1 / 4 |^(F)||W(F)| 
and hence 

\{(v',w',u>) G V"(F)xW"(F)xU"(F) \ h(u') = P(f(v>), g(w'))}\ « |F|" 1/4 |T/(F)||VF(F)|. 

In particular, the variety 

{(v',w',u') G V" x W" x U" | h(u')=P(f(v'),g(w'))} 

has o(|U(F)||M / (F)|) = (|F| dim ( y )+ dim ( H/ )) F-points. On the other hand, this 
variety is quasi-finite over {(V, w', P(f(v'), g(w'))) \ v G V",w G W"} and thus 
has dimension dim(U) + dim(W / ). The only way this is consistent with Lemma [26] 
is if the variety is not irreducible, and the claim follows. □ 

7. Solving the algebraic constraint 

In this section we solve the algebraic constraint that emerges in Theorem 1381 
in the case when V = W = k is just an affine line and P is polynomial. More 
precisely, we show 

Theorem 41. Let k be an algebraically closed field of characteristic zero. Let 
P : k 2 — > k be a polynomial with the property that the set 

(33) {(P(a, c), P(a, d),P(b, c), P(b, d)) \a,b,c,d€ k} 

is not Zariski dense in k . Then one of the following statements hold: 

(i) (Additive structure) There exist polynomials Q,F,G : k — > k such that 
P(x, y) = Q(F(x) + G(y)) for all x, y G k. 

(ii) (Multiplicative structure) There exist polynomials Q, F,G : k — » k such that 
P(x,y)=Q(F(x)G(y))foranx,yek. 

This is not quite what we need for Theorem [50] because the polynomials ob- 
tained here are defined over an algebraically closed field k, rather than over the 
nonstandard finite field F; we address this issue at the end of this section. 

The strategy of proof of Theorem [41] will be to use the Lefschetz principle to 
reduce to the complex case k = C, and then use complex analytic methods to 
analyse certain Riemann surfaces associated with P. 
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We begin with the reduction to the complex case fc = C. This will be a standard 
"Lefschetz principle" argument. We first observe that to prove Theorem |4TJ it 
suffices to verify the case when fc has finite transcendence degree over the rationals 
Q, as can be seen by passing to the field of definition of P. In particular, we may 
identify k with a subfield of C. If the set (|33| is not Zariski dense in fc 4 , then it is 
contained in a proper subvariety of fc 4 , and hence 

{(P(a, c), P(a, d), P(b, c), P{b, d)) \a,b,c,d£ C} 

is contained in the complexification of that variety and is thus also not Zariski 
dense. Applying Theorem |?T] with k — C, we conclude that we may find polyno- 
mials Q ,F ,G ■ C — > C such that P(x,y) — Qo(F (x) + G {y)) or P(x,y) = 
Qo{Fo( x )Go(y)) for all x,y £ C. For sake of discussion let us work with the 
additive case P(x,y) = Qq{Fq(x) + Go(y)), as the multiplicative case is analo- 
gous. We are not quite done yet, because Qo,F ,Gq have coefficients in C rather 
than in k. We claim however that we may find polynomials Q,F,G : k — > k 
of the same degree as Qo, Po,Go respectively and with coefficients in k such that 
P(x, y) — Q(F(x)+G(y)) for all x,y £ k. Indeed, this can be viewed as an algebraic 
constraint satisfaction problem in the coefficients of Q, F, G, where the constraints 
arc defined over k. If this problem has no solutions over the algebraically closed 
field fc, then by Hilbcrt's nullstcllensatz we see that the problem has no solution 
over any extension of fc either, and in particular has no solutions in C, contradicting 
the existence of the factorisation P(x, y) = Qo(Fo(x) + Go(y)). This concludes the 
reduction to the fc = C case. 

Henceforth k = C. Let P : C 2 — > C be a polynomial such that 

{(P(a,c),P(a,d),P(b,c),P(b,d)) \a,b,c,d£ C} 

is not Zariski dense, i.e. it is contained in a proper subvariety of C 4 . We conclude 
that for generic (a, b, c, d) 6 C 4 , the derivative of the map 

(a, 6, c, d) ^ (P(a, c), P(a, d), P(b, c), P(b, d)) 

from C 4 to C 4 is singular. As this is a closed condition, this in fact holds for all 
(a,b,c,d) £ C 4 . Taking determinants, we conclude the constraint 

fPi(a,c) P 2 {a,c) \ 

Pi(a,d) P 2 (a,d) 

P x {b,c) P 2 {b,c) 

V Pr(b,d) P 2 (b,d)J 

for all a,b,c,d £ C, where Pi,P 2 are the partial derivatives of P with respect to 
the first and second variable respectively. Expanding this out, we conclude that 

P 1 (a, c)P 2 (a, d)P 2 (6, c) Pi (6, d) = P 2 (a, c)P 1 (a, d)P x (6, c)P 2 (b, d) 

for all a, b,c,d £ C. 

If one of Pi or P 2 is identically zero, then P(x, y) is a function of just one of the 
two variables x, y, and one trivially has both additive and multiplicative structure, 
so we henceforth assume that Pi, P 2 are not identically zero. We can then rearrange 
the above identity as 

Tr( a ' c )lr(M) = rr(a,d)-±(b,c) 

r 2 r 2 F 2 f2 
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for generic a,b,c,d <E C. Fixing a generic choice of b,d, we conclude in particular 
(after relabeling c as b) that 




for generic a, b G C and some rational functions /, g : C U {00} — > C U {00}, not 
identically zero or identically infinite. 

To motivate the argument that follows, suppose that we could make a change of 
variables o4z, b t-¥ w such that 

dz = f(a)da; dw — g(b)db, 

thus z is a primitive of / applied to a, and w is a primitive of g applied to b. Then, 
by the chain rule, the constraint (|34p simplifies to 

dP dP _ (| 

dz dw 

and so P must be a function of z + w. This already resembles the conclusion of 
Theorem |41] quite closely (particularly if one formally rewrites the multiplicative 
conclusion P(x,y) = Q(F(x)G(y)) as P(x,y) = Q o exp(\ogoF(x) + log oG(y))). 

Of course, the rational functions /, g may contain simple poles, which by the 
residue theorem implies that they do not have single- valued primitives taking values 
in C. However, through monodromy one can still define a primitive taking values 
in some Riemann surface covering the (punctured) complex plane. So, it is natural 
to try to execute the above strategy in the framework of Riemann surfaces rather 
than on the complex plane. This is essentially what we will do next, except that 
we will not explicitly use the abstract language of Riemann surfaces and instead 
work with the more concrete machinery of paths in the complex plane, in order to 
easily take advantage of the additive structure of C. 

We turn to the details. By collecting the residues of the poles of /, we may write 

rn 

/ ( ) = ^_£z_ + /( ) 

* — ' a — a-j 

for some finite number of distinct simple poles ai,...,a m G C, some non-zero 
complex residues ai, . . . , a m G C, and a rational function /(a), such that all poles 
of / have zero residue. In particular, / has a primitive F, which is a rational 
function, so that 

m 

(35) f{a) = J2-^-+F'(a) 

3=1 3 

for all but finitely many a £ C. By translation, we may assume without loss of 
generality that a\, . . . , a m 7^ 0, and that F(0) = 0. 

Now let 7 : [0,1] — > C\{ai, . . . , a m } be a smooth curve avoiding the poles 
ai,...,a m of / that starts at 7(0) = 0. From the fundamental theorem of cal- 
culus, we can evaluate the contour integral J f = J f(a) da as 

// = Ea j LogI^ + F(7(l)) 
h 3= i a J 
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where (by abuse of notation) Log ° J is one of the logarithms log 7 ^ aj of 

7 ^~ aj , with the exact branch of logarithm used depending on the homotopy class 
of 7 in C\{a l7 . . . , a m }. (Here we adopt the convention that J / = oo if 7 termi- 
nates at a (zero-residue) pole of /, and evaluate contour integrals that pass through 
(zero-residue) poles of / by arbitrarily perturbing the contour around such poles.) 
In particular, we see that 

/ / e c 7(1) + r\ 

where Ti < C is the finitely generated subgroup of C generated by 27riai, . . . , 2nia m , 
and for any a G C\{a!, . . . , a m }, c a + i\ denotes the coset 



(36) Ca + r i: =g aj log^i + F(a); 

3=1 aj 

thus c a is only defined up to an additive error in T\. Conversely, for any given 
choice of endpoint a <G C\{ai, . . . , a m }, and any element of the coset z = c a + Ti, 
one can find a smooth curve 7 : [0, 1] — > C\{ai, . . . , a m } from to a with f f = z. 

We need the following technical lemma, reminiscent of the Picard theorem on 
the possible values of entire functions. 

Lemma 42 (Almost surjectivity). For all complex numbers z outside of at most 
one coset ofTi, there exists at least one smooth curve 7 : [0, 1] — > C\{ai, . . . ,a m } 
starting at with J f = z. If T± is not trivial and is not a rank one lattice 
Ti = 2iria7i, then the caveat "outside of at most one coset of T\ " in the previous 
claim may be deleted. 

To see why it is necessary sometimes to exclude a coset of T, consider the case 

/(«) : = —7 —5 

a — 1 a — 2 

so that T is the rank one lattice T = 2mZ and 



/ = Log(l- 7 (l))-Log(l-^). 



'7 

This quantity can attain all complex values except for those on the coset log 2 + V 
(this would require 7 to terminate at infinity, which is not possible since 7 has to 
stay inside C\{ai, . . . , a m }). Another example is when 



so that T = {0} is trivial and 



/ = 



{a -If 
7(1) 



7 



1-7(1) 

In this case, the coset 1 + T = {1} cannot be attained. 



Proof. We first dispose of an easy case when Ti is trivial (i.e. / has no residues at 
any pole). Then J f = ^(7(1)); as / is not identically zero, the rational function 
F is not constant, and the claim follows from the fundamental theorem of algebra. 
(Note that the rational function F — zq has a constant numerator for at most one 
complex number zq.) Thus we may assume henceforth that T\ is non-trivial. 
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Let f2 denote the set of all z such that z = J / for at least one 7 as in the lemma. 
Clearly contains {0}; by the preceding discussion, il is also the union of cosets 
of Ti. Our objective is to show that fl is either equal to C, or C with one coset of 
Ti deleted, with the latter case only permitted when Ti is a rank one lattice. 

If the rational function F is not identically zero, then it is non-constant (since 
-F(O) = 0), and thus has a pole somewhere in C U {00}. In any neighbourhood of 
this pole, the quantity (|36|) attains all sufficiently large finite values in C, thanks to 
Rouche's theorem, thus f2 contains the exterior of a sufficiently large disk. Since i\ 
is a non-trivial (hence unbounded) subgroup of C, and f2 is invariant with respect 
to translations by Ti, we conclude in this case that Q — C as required. Thus we 
may assume without loss of generality that F — 0. 

Next, suppose that the residues ax, . . . , a m do not all lie on a single line Ma. We 
work in the neighbourhood of a single pole a±. From (|36[) and Rouche's theorem, 
we see that fi contains the half-plane {a\z | Re(z) < — C} for some sufficiently 
large C. On the other hand, as the ai, . . . , a m do not all lie in a single line Ra, the 
group Ti is not contained in 27riaiR, and so the orthogonal projection onto aiM 
is unbounded. From this and the Ti-invariance of f2 we conclude that Q = C as 
desired. 

The only remaining case is when all the a±, . . . , a m lie on a single line; by rota- 
tion, we may assume that the a\, . . . , a m are all real. Let us first assume that the 
ai, . . . , a m are not all commensurable (i.e. rational multiples of each other), so that 
Ti is a dense subset of 27riR. As non-constant analytic functions are open maps, 
and / locally has an analytic primitive, we see that fl is open; as C is connected, 
and fi clearly is non-empty, it thus suffices to show that f2 is closed. Accordingly, 
let z n be a sequence in f2 converging to a finite limit z € C; our task is to show 
that z also lies in il. 

By definition, z n = J f for some sequence of smooth curves 7„ : [0, 1] — > 
C\{oi, . . . ,a„}, all starting at but possibly having distinct endpoints 7«(1)- By 
the Bolzano- Weierstrass theorem, we may pass to a subsequence and assume that 
a ' n '■= 7n(l) converges to some limit a'eCU {00}. 

If a' = ai for some pole ai, then from (|36|> and the hypothesis that the a« 
are all real, we see that Re(z„) — > — sgn(ai)oo as n — > 00, which contradicts the 
convergence of z n to a finite limit. If a' € C\{ai, . . . , a n }, then by choosing suitable 
branches of the logarithm, one can make the map ai->c n analytic and non-constant 
(hence continuous and open) in a neighbourhood of a'; as z n € c a ' n + T\ for all n, 
we conclude on taking limits that z € c a > + T; since a i— > c a is open, this implies 
that z £ fl as required. 

Next, we consider the case when a' = 00, thus 7„(1) — > 00. If a% + . . . + a rn is 
non-zero, then from (|36|) we see that Re(z„) — > sgn(ai + . . . + a m )oo, contradicting 
the convergence of z n to a finite limit. Thus ot\ + . . . + a m = 0, and so at least 
two of the ai have opposing signs. Without loss of generality we may assume that 
ai > > ot2- From the real part of / depends only on the endpoint 7(1), 
and goes to — 00 as 7(1) approaches a\ and +00 as 7(1) approaches a2- By the 
intermediate value theorem, we may thus find a smooth curve 7 starting at such 
that Re / / = Rez; as T is a dense subgroup of 27rz]R, we conclude that f2 contains 
a dense subset of the line z + 27rzIR. As Q is open and Ti-invariant, we conclude 
that z £ £1 as desired. This concludes the proof of = C in the case that T is a 
dense subgroup of 27rzIR. 
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Finally, we have to consider the case when Ti is a rank one lattice in 27rzR. If 
«! + ... + a m is non-zero, then we may repeat the previous arguments to show 
that Q — C, so we may assume that ol\ + . . . + ct m = 0. In this case, we see from 
([36f that if 7„ : [0, 1] — > C\{ai, . . . , a m } is a family of paths from the origin with 
a' n := 7„(1) going to infinity and z„ := J 7n /, then 

dist(z„, Coo + Ti) ->• 
as ?i — > oo, where Coo + T\ is the coset 

m 

Coo + Ti := - aj logaj + r x . 

j=i 

From this and the previous arguments, we see that any limit point z of f2 will 
also lie in f2 unless z lies in Coo + Ti. Thus the set fi\(coo + T\) is open, closed, 
and non-empty in C\(coo + Ti); as the latter set is connected, we conclude that 
contains C\(coo +Tx), as required. □ 

All the above analysis involving the poles and residues of / may similarly be 
applied to g. More specifically, we may write 

n a 

(37) 3(&) = Er i V + G 'W 

k=l j 

for all but finitely many b £ C, where b±,...,b n € C are distinct and non-zero, 
/3xj • • • ,0k are non-zero, and G is a rational function with G(0) = 0. We let T2 be 
the subgroup of C generated by 2wi/3i, . . . , 2irif3k, and have the analogue of Lemma 

S3 

Lemma 43 (Almost surjectivity) . For all complex numbers w outside of at most 
one coset of T2, there exists at least one smooth curve 7 : [0, 1] — > C\{6i, . . . , b n } 
starting at with J^g — w. If T% is not trivial and is not a rank one lattice 
T2 = 2iriaZ, then the caveat "outside of at most one coset of T2 " in the previous 
claim may be deleted. 

Now, we are able to solve the differential equation (|3"4"]) . obtaining a rigorous 
version of the heuristic that P is "a function of z + w" in some sense: 

Proposition 44 (Additive structure in z, w). There exists an entire function H : 
C — > C with the property that 

(38) P(7i(l),72(l))=ff(/ /+ / 9) 

for all smooth curves 71 : [0, 1] — > C\{a±, . . . , a m } and 72 : [0, 1] — > C\{6i, . . . , b n } 
with 71 (0) = 72(0) = and /, g f 00. 

Proof. We begin by establishing a preliminary property of P implied by (|38[) . 
namely that 

(39) P( 7 i(l),6)=P(7i(l) > 6) 

whenever b € C and 71, 71 : [0, 1] — > C\{ai, . . . , a m } are smooth curves with 71(0) = 
7i(0) and 4/ = 4/9600. 

Fix 71, 71 as above, and let wq be a complex number to be chosen later. By 
perturbing 71,71 without moving the endpoints, we may assume that 71,71 avoid 
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all the poles of /. By unique continuation, it suffices to establish (|39|) for all 6 in 
some non-empty open subset of C. 

By the connectedness of C\{ai, . . . , a„}, even after deleting the remaining poles 
of /, we can find a smooth homotopy 7^ : [0,1] x [0,1] — > C\{ai, . . . , a n } with 
7^(4,0) = 71 (i) and 7*(t,l) = 71(f) for all t G [0,1], and 7?(0,s) = for all 
s G [0,1], and such that 7J avoids all the poles of /. 

The quantity J 7 *(. s \ f will, in general, not be constant in s; however, it varies 
smoothly in s, and thus lies in a ball B(0, R). From Cauchy's theorem, we see that 

/ /= / /+ / f 

J~ft(-,s+ds) •>!*(•, a) J~f'(l,[s,s+ds]) 

for any 0<s<s + ds< 1, and thus 

(40) / = /(7*(M))-f 7 *(M) 
for all s G [0,1]. 

Suppose that we can find a smooth function 6 : [0, 1] — > C\{6i, . . . , b n } such that 
6(0) = 6(1) = 6, 6 avoids all the poles of g, and 

(41) 9(b(s))±b(s) = -/( 7 *(1, 7*(1, «)• 

as as 

From (1341) and the chain rule, we then have 

S(f(M),^)) = o 

as 

for all s G [0, 1], and thus by the fundamental theorem of calculus 

P(7i(l), 6) = P( 7 *(l, 0), 6(0)) = P(t*(1, 1), 6(1)) = P( 7 (l), 6) 

which is (|3T)|) . Thus, it will suffice to construct such a smooth function 6 for all 6 
in a non-empty open subset of C. 

Suppose that we can find a ball B(wq,2R + 1) and a complex analytic local 
diffcomorphism $ : B(wq, 2R + 1) — > C which avoids all the poles and zeroes of g 
in its image, and is an inverse primitive of g in the sense that 

(42) $'H =g(<f>(w))- 1 

for all w G B(wo,2R + 1). Then for any 6 in the open set $>(B(wq,1)), so that 
6 = $(w) for some w G B(w Q , 1), we can construct the desired function 6 by the 
formula 



6(s) :=$( W + / / - / /)• 

Indeed, from the chain rule and (|4"D]) . (|4"2"|) we obtain (|4ip . and the condition 6(0) = 
6(1) = 6 follows from the hypothesis f j / = J- i /. Thus, to finish the proof of (|3T))) . 
it suffices to obtain a local diffeomorphism $ : B(wq, 2R + 1) — > C with the stated 
properties. 

Suppose that the function G is not identically zero, and thus has a pole at 
some point 6* G C U {00}. If 6* is distinct from 61,..., b m . By (|3"T1) . g then 
has a meromorphic primitive G in a neighbourhood of 6* which has a pole at 
6* but is otherwise holomorphic, and which will be a local diffeomorphism if the 
neighbourhood is small enough. In particular, by Rouche's theorem, we see that 
if wo is a complex number of sufficiently large magnitude, G is a diffeomorphism 
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between some open subset of this neighbourhood and B(wq, 2R +1). Taking $ to 
be the inverse of G, we obtain the claim. 

If the pole &* coincides with one of the bi, then we do not have a meromorphic 
primitive in a neighbourhood of 6* any more, but we still have a primitive on 
sufficiently small semi-neighbourhood such as {b E C | \b — b m \ < e; Re(6 — &*) > 0} 
which is still a local diffeomorphism. By Rouche's theorem (the point being that 
the pole of G dominates the logarithmic factors), one can still find a diffeomorphism 
from some subset of this neighbourhood and B(wo,2R + 1) for a suitably chosen 
Wo, and we can argue as before. 

Now suppose that G is identically zero. As g is not identically zero, we see 
from ([37|) that n is non-zero, thus g has a simple pole at b±. From (|37|) . we see 
that g formally has a primitive in a neighbourhood of b\ that is equal to the 
sum of Pi log(6 — &i) plus a holomorphic function. This is however not rigorous 
because log is multivalued. To get around this difficulty, we cover a small punctured 
neighbourhood B(b\, e)\\b\} of b\ by the half-space {z | Re(z) < logs} via the 
translated exponential map z t— > 6i+exp(z). The differential g{b)db on B(bi, s)\{bi} 
then pulls back to the differential 

g(b\ + exp(z)) exp(z)dz 

on the half-space {z | Re(z) < loge}. For e small enough, this differential has a 
primitive G, which is the sum of f3\z plus a holomorphic function of b\ + exp(z). 
In particular, if u>o is a complex number with /3-j" Re(?«o) sufficiently large and 
negative, we see from Rouche's theorem that G is a diffeomorphism between some 
open subset of this neighbourhood and B(wo,2R +1). Taking $ to be the inverse 
of G, composed with the translated exponential map z H ► b\ + exp(z) (i.e. $(w) := 
bi + exp(G _1 (w))), we obtain the claim. This concludes the proof of (|39]l. 

From (|39|) . we see that for any two smooth curves 71 : 6[0, 1] — > C\{ai, . . . , a m } 
and 72 : [0,1] -»■ C\{h, . . . ,b n } with 7l (0) = 72 (0) and J^fJ^g + 00, the 
quantity P( 7 i(l), 72(1)) is a function of f f and 72(f)- Applying the analogue of 
(|39| with the rules of /, g reversed, we conclude that P( 7 i(l), 72(1)) is a function 
of f and J 2 g. Thus, there is a function Q : fii x fl 2 — >• C such that 

(43) P( 7 i(l),72(l)) = Q(/ /, / 3) 

where fii is the set of all finite values of J j / for smooth 71 : [0, f ] — > C\{ai, . . . , a m }, 
and £! 2 is the set of all finite values of / g for smooth 7 2 : [0, 1] — > C\{6i, . . . , b n }. 
Note from Lemmas [42j (|43|) that fii, O2 are equal to the entire complex plane C 
with at most one coset of ri,r 2 respectively deleted. 

Since / and g have local primitives which are analytic and non-constant, and 
hence open, we see that the function Q is continuous. By further use of these 
local primitives, we see that for all (z ,w ) £ Qi x fi 2 , the function Q(z,Wq) is 
holomorphic for z in a sufficiently punctured disk B(zq, £)\{zq}, and thus (by the 
continuity of Q at zq and Morera's theorem) is holomorphic on the unpunctured disk 
B(zo,e) also. Thus Q(z,w) is holomorphic in the first variable z, and is similarly 
holomorphic in the second variable; thus it is a smooth biholomorphic function on 
fii x fl 2 . 

Let (z ,wq) 6 fii x £! 2 . For z G B(z Q ,e)\{z Q } and w e B(w 0l e)\{wo} for 
e > small enough, we may use local primitives to find smooth 71 : [0, 1] — > 



EXPANDING POLYNOMIALS AND A REGULARITY LEMMA 



47 



C\{ ai ,...,a m } and 72 : [0,1] -> C\{6i, . . . , b n } with 71 (0) = 72 (0) = 0, J^f = z, 
§12 9 = Wl an< ^ /('yi (■'■))' 5(72(1)) 7^ 0. If one then elongates 71 by an infinitesimal 
line segment from 71(1) to 71 (1) + /(7i(l)) -1 di for some infinitesimal dt, and 
simultaneously elongates 72 by an infinitesimal line segment from 72(1) to 72(1) + 
g(72(l))~ 1 ^, we see fronQ 63) that 

^(7i(l) + /(7i(l)) -1 *, 72(1) + 5 (72(l))" 1 d<) = Q(z + dt,w + dt) + o{dt) 
and thus by the chain rule and sending dt to zero 

/( 7 i(l))- 1 Pi(7i(l),72(l)) + .g(72(l))" 1 P2(7i(l),72(l)) = Qi(^,H + Q2(2,H- 

By (j34|) . the left-hand side vanishes, and thus 

Qi(z,w) + Q 2 (z,w) = 

for all z £ B(zq, s)\{zq} and w € B(wq, e)\{wo}; by the smoothness of Q and the 
arbitrariness of zq,wq we conclude that Q\(z,w) + Q 2 (z,w) = for all (z,w) € 
Qi x f?2. In particular, for any complex number (£C, the function 

z 1 y Q(z,(-z), 

which is defined outside of a discrete subset of C, has zero derivative and thus 
extends to a constant function on C. In other words, we can find a function H : 
C — > C such that 

Q{z, w) = H{z + w) 

for all (z, w) € Oi X Since Q is biholomorphic and ^1,^2 are complements of 
discrete sets, we conclude that H is holomorphic on all of C, and is thus entire, and 
the proposition follows. □ 

We now analyse the entire function H given by the above proposition. Observe 
from (|36p that one can freely modify f f by any element of Ti without affecting 
the endpoint 71 (1). As a consequence, we conclude that H must be periodic with 
respect to translations by F±. Similarly it is periodic with respect to translations 
by T 2l and is thus periodic with respect to the combined subgroup T± + T 2 - If 
this subgroup contains two non-zero elements that are not real multiples of each 
other, then H thus descends to a holomorphic function on a torus, and is thus 
constant by Liouville's theorem, which makes P constant, in which case Theorem 
1411 is trivial. Thus we may assume that I\ + T 2 does not contain two non-zero 
elements that are not real multiples of each other, and is thus contained in a line; 
without loss of generality we may normalise (after rescaling the a and b variables) 
so that ri + T 2 < 2iriR. If Ti + T 2 is dense in 2iriM., then H is constant along 2iriR; 
since non-constant analytic functions have isolated zeroes, we conclude that H is 
constant on all of C, so again P is constant and Theorem WW is trivial in this case. 
Thus we may assume that Ti + T 2 is discrete; without loss of generality we may 
normalise so that either Ti + T 2 = {0} or I"i + T 2 = 2niZ. As we shall see shortly, 



Note that while H43H is stated for smooth curves, it extends automatically to continuous 
curves (and in particular to the concatenation of two smooth curves) by a limiting argument. 
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these two cases correspond to the additive and multiplicative case^ of Theorem 

EU 

First suppose that Fi + T 2 = {0}, so that / and g have no poles with non-zero 
residues. In this case, we see from (|36|) that 

j / = F( 7l (l)); f / = G( 72 (1)) 

J 7l ^72 

and hence from Proposition 04] 

P(a,b) = H(F{a) + G(b)) 

whenever a, b G C and F(a), G(b) oo. If F has a pole at some finite a*, then by 
Rouche's theorem F(a) can take any sufficiently large value for a in a neighbourhood 
of a*, and the above equation then forces the entire function H to be bounded in 
a neighbourhood of infinity, and is thus constant by Liouville's theorem, in which 
case Theorem 0T] is trivial. Thus we may assume that the rational function F 
has no finite poles, and is thus a polynomial; similarly we may assume that G is 
a polynomial. By holding b fixed and sending a to infinity, we conclude that H 
is of polynomial growth; being entire, we conclude from the generalised Liouville 
theorem that H is itself a polynomial, and we have obtained the additive conclusion 
of Theorem EJ 

Now suppose that Fi + T 2 = 2iriZ. Then all the ai, j3j are integers, and from 
(|36|) we have 

/ /eF( 7l (l)) + logF( 7 i(l)) 

and 

f .geG( 72 (l))+logG(7 2 (l)) 

for some rational functions F,G, with at least one of F,G non-constant, and all 
71,7a for which F( 7l (1)), G( 72 (l)) ^ oo and F( 7l (1)), G( 72 (l)) ^ 0, oo. From 
Proposition [44j we conclude that 

P(a, b) = H(F(a) + G(b) + log F(a)G(b)) 

whenever F(a),F(b) ^ oo and F(a),G(b) ^ 0,oo; note that the right-hand side 
is well-defined since H is 27r«Z-periodic. We use this periodicity to write H{z) — 
H(exp(z)) for some holomorphic function H : C\{0} — > C, and conclude that 

(44) P(a, b) = H{exp(F{a) + G(b))F(a)G(b)). 

If F has a pole at some finite a*, then for fixed b, exp(F(a) + G(b))F(a)G(b) can 
take any sufficiently large value in a neighbourhood of a*, and so by arguing as 
in the additive case H and hence P is constant, in which case the claim is trivial. 
Similarly if G, F, G have finite poles, so we may assume that F, G, F, G are all 
polynomials. 

18 In principle, the case when Ti + T2 is a rank two lattice would correspond to a case in which 
P is given in terms of an elliptic curve group law instead of addition and multiplication, but such 
laws can only be expressed in terms of rational functions rather than by polynomials and so do 
not actually arise in this analysis. However, if P were generalised to be a regular map on the 
product of two curves, rather than a polynomial map on the product of two affine lines k, then 
one would have to also consider constructions arising from elliptic curve group laws; and in higher 
dimensions one would also have to consider more general abelian varieties. It seems of interest to 
extend Theorem 1411 to these settings, but we will not pursue this matter here. 
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At least one of F, G is non-constant; without loss of generality, assume that F 
is non-constant, and so has a zero at some finite a*. Letting a approach a* while 
fixing b at some generic value, we see that exp(F(a) + G(b))F(a)G(b) can attain any 
sufficiently small non-zero value, and conclude from (|44|) that H remains bounded 
in a neighbourhood of the origin, so that the origin is a removable singularity and 
H can be extended to an entire function. 

Suppose that F is non-constant. Then, fixing b to be some generic value, we 
see from Rouche's theorem that any sufficiently large complex number z can be 
represented as exp(F(a) + G(b))F(a)G(b) for some a = 0(log|z|). We conclude 
that H grows at most like a power of a logarithm (i.e. H(z) = 0(log ^ \z\)) 
and so by the generalised Liouville theorem, H is constant and we are again done. 
Thus we may assume that F is constant, and similarly G is constant. We may then 
absorb the exp(F(a) + G(b)) factor in (l44l) into F or G, and we have obtained the 
multiplicative conclusion of Theorem l41l The proof of Theorem 14 II is now complete. 

Now we move from the algebraically closed field setting to the nonstandard finite 
field setting. 

Theorem 45. Let F be a nonstandard finite field of characteristic zero. Let P : 
F 2 — > F be a polynomial with the property that the set 

(45) {(P(a, c), P(a, d), P(b, c),P(b, d)) \a,b,c,d€ F} 

— 4 

is not Zariski dense in F . Then one of the following statements hold: 

(i) (Additive structure) There exist polynomials Q,F, G : F — > F such that 
P(x, y) = Q(F(x) + G(y)) for all x, y e F. 

(ii) (Multiplicative structure) There exist polynomials Q, F, G : F — > F such 
that P{x,y) = Q(F(x)G(y)) for all x,y G F. 

Proof. Set k := F. If (|45|) is not Zariski dense, then from Lemma[26lwe see that (|33|) 
is not Zariski dense either. Thus by Theorem |4T| we have P(x, y) = Q(F(x) + G(y)) 
or P(x,y) = Q(F(x)G(y)) for some polynomials Q,F, G : k — > k defined over k. 
The remaining difficulty is to replace these polynomials by polynomials that are 
defined over F. 

Note that we may assume that F, G are non-constant, as the claim is trivial 
otherwise. 

Suppose first that we have the additive structure P(x,y) — Q(F(x) + G(yj). 
Clearly P(F,F) C F, and hence |F(F) + G(F)| < |F|. The field of definition of 
F, G is a finite extension F' of F. It will be convenient (particularly when we turn 
to the more difficult multiplicative case) to coordinatise this field F' as follows. Let 
n := [F' : F] be the index of the extension F' over F. Let S : F — > F be a generic 
monic polynomial of degree n, so that S is irreducible and the roots t\, . . . ,t n £ F' 
are distinct. We can then identify F' with the field 

{(R(t 1 ),...,R(t n ))\ReF[t]}ck n 

where R ranges over the polynomials of one variable in t, where we give k n the ring 
structure of componentwise addition and multiplication. This can be viewed as an 
7i-dimensional subspace (over F) of k n , with basis eo, . . . , e„_i given by 

e i := l^lJ • • • J t"n)- 

The restriction F, G : F — > F' of F, G to F can thus be written in components as 
F = J2i=o Fi e i' G = J27=o for some polynomials Fj, Gi : F — > F defined over 
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F. We can then extend these maps to polynomial maps F,G : k — » k n by the same 
formulae: 

n— 1 n— 1 

F:=^F iei ; <5:=^G iei . 

8=0 1=0 

As F, G are non-constant, F, G are non-constant also. 
We then have \F(F) + (5(F) | < |F|. Since 

]T l^(F) n (« - G(F))| = |F(F)||G(F)| » |F| 2 

v€F(F) + G{F) 

and |F(F) n (v - G(F))| < |F|, we conclude that 

|F(F)n(w-G(F))|»|F| 

for 3> |F| values of v. In particular, F(k) n (u — G(fc)) is infinite for infinitely many 
u G k n . As -F, G are non-constant polynomial maps, F(k),G(k) are irreducible 
quasiprojective curves in k n , and so 

F(k) = v-G(k) 

for inhnitely many v. In particular, the symmetry group H :— {v € k n \ F{k) = 

v + F(k)} is infinite. But this symmetry group is an algebraic subgroup of the 
additive group k n of dimension at most 1, and so it is a line. This implies that 
F{k) and G(k) are translates of the same line, and so F(x) + G(y) is an affine (over 
F) function of Fi(x) + Gi(y) for some < i < n — 1. From this it is easy to see that 

P(x, y) - Q(F(x) + G(y)) = Q l {F l (x) + G,(y)) 

for some polynomial Qi : F — > F defined over F, and conclusion (i) of Theorem l45l 
follows. 

Now we turn to the multiplicative case P(x, y) = Q(F(x) ■ G(y)). By arguing as 
before we have 

|F(F) • (5(F) | < |F|. 

Since F, G have only boundedly many zeroes, F(F) and G(F) have all but bound- 
edly many points in (fc x ) n , where k x := fc\{0} is the multiplicative group of k. If 
we define F(F) X := F(F) n (k x ) n and G(F) X := (5(F) n (fc x )", then we have 

| J F(F) X -G(F) X | « |F|. 

Arguing as in the additive case, we conclude that the symmetry group H := {v £ 

(fc x ) n | F(k) X = u • F(k) X } is infinite, where #(fc) X := F{k) n (fc x )™. This 
is a algebraic subgroup of the multiplicative group (fc x )™ that is contained in a 

-= X 

dilate of the connected curve F(k) , and is therefore a connected curve. Us- 
ing the Lefschetz principle to embed the field of definition of H into C, we can 
thus view H as a connected one-dimensional algebraic subgroup of (C x ) n . From 
Lie group theory, a one-dimensional connected subgroup of (C x )™ takes the form 
{(exp(aiz), . . . , exp(a n )2:) | z € C} for some complex numbers a±, . . . , a n . By in- 
specting the limits as z — > oo, one can check that such subgroups are algebraic only 
when ai,...,Q n are commensurate, in that they lie in a dilate Q, in which case H 
can be parameterised as {(t mi , . . . , t mn ) \ t G C x } for some integers mi, ... , m n . 
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As F(k) is contained in a dilate of H, we see (after restricting back from C to k) 
that 

F(k)c{( Cl t m \...,c n t m ") \tek} 

for some constants c\,...,c n G k. Factoring the components of F into monic 
irreducible polynomials, we conclude that 

F(x) = (c 1 F'(x) a \...,c n F'(x) a ») 

for some monic polynomial F' : k —> k defined over k, and some natural numbers 
a%, . . . ,a n (a scalar multiple of the mi, . . . , m n ). 

The Frobenius endomorphism Frobr generates the Galois group Gal(F'/F) = 
Z/nZ, which acts transitively on the roots ti,...,t n of R. This group then also 
acts on the polynomial components CiF'(x) ai of F, by acting on the coefficients 
of these polynomials; by taking degrees, we conclude that ax = ■ . ■ = a n = a, so 
that F' is invariant with respect to the Frobenius action and is thus defined over 
F. Thus F is a scalar multiple of a polynomial (F') a defined over F. A similar 
argument shows that G is also a scalar multiple of a polynomial (G') b defined over 
F, and so P can be written in the form P(x,y) = Q'((F' ) a (x)(G') b (y)) for some 
polynomial Q'\ as P, F', G' are Frobenius- invariant (with F', G' non-constant), Q' is 
also Frobenius- invariant and thus defined over F, giving the required representation 
of P. □ 

Combining Theorem 1451 with Theorem 1381 we immediately obtain Theorem 1201 

8. Weak expansion 

Now we prove Theorem |2"T1 (and hence Theorem [2]), by combining the above 
arguments with the Fourier-analytic arguments from |36j . Suppose for contradiction 
that we can find F, P which obey the hypotheses of this theorem, but do not 
obey any of the four conclusions (i)-(iv). Applying Theorem l20l (and noting that 
moderate asymmetric expansion certainly implies weak expansion) , we see that we 
must have either additive structure in the sense that 

(46) P(xi,x 2 )=Q(F 1 (x 1 ) + F 2 (x 2 )) 

for some polynomials Q, F\, F 2 : F — > F, or multiplicative structure in the sense 
that 

(47) P(x 1 ,x 2 )=Q(F 1 (x 1 )F 2 (x 2 )) 

for some polynomials Q, F\, F 2 : F — > F. 

Suppose first that we have additive structure (|4"6")l . As the conclusion (iii) of 
Theorem [2~T1 is assumed to fail, we can find a nonstandard subset A C F with 

(48) \A\ ^ i/ie 
and 

\P(A,A)\ «< IFI 1 / 2 !^ 1 / 2 . 

We can assume that P is non-constant, so that Q is non-constant. We can also 
assume F%,F 2 non-constant, as the claim is immediate otherwise. In particular, Q 
has fibres of bounded cardinality and so |Q(B)| 3> \B\ for all nonstandard BcF. 
We conclude that 

(49) |i<i(A) + F 2 (A)\ «< IFI 1 / 2 !^ 1 / 2 . 
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If we let B := F 2 (A) x Fi(A) C F 2 and C := (Fi(A) + F 2 (A)) x (Fi(A) + F 2 (A)) C 
F 2 , then we have 

(F 1 {t),F 2 {t)) + BcC 

for all t e A, and hence 

( 5 °) E E is(^y)ic(^ + fiW,2; + i 71 2W)>^||B|»|^| 3 . 

tGF (i,i,)eF 2 

We now use Fourier analysis to expand 

(51) l B (x,y)= ls(Xi)X2)xi(aj)X2(y) 

Xi,X2£F 

where F is the space of nonstandard (additive) characters on F, that is to say the 
nonstandard homomorphisms x '■ F — > *S 1 from F (viewed as an additive group) 
to the unit circle, and ls(XiiX2) are the Fourier coefficients 



lfl(Xi)X2) := E x , yeF l B (x,y)xi(x)x2{y)- 
Similarly, we may write 

(52) l c (x + F 1 (t),y + F 2 (t)) = ]T l c (xu X2)xi(x) X 2(y)xi(F 1 (t)) X 2(F 2 (*)). 

Xi,X26F 

Multiplying f|52|) by the complex conjugate of (|51|) and summing using the orthog- 
onality properties of characters, we may expand the left-hand side of (|50[) as 

|F| 2 is(xi^2)ic(xi,X2)E^( F i( t ))^ 2 ( F2 W)- 

Xi,X2£F «6F 

Using the trivial bound |xi|, \x%\ — 1> we see that the contribution of any given 
pair (xi, X2) to the above sum is 

|F| 2 (|5|/|F| 2 )(|C|/|F| 2 )|F| 

which by (g9) and the trivial bound \B\ < \A\ 2 is o(|A| 3 ). We thus have 

E |is(xi,X2)l|ic(xi,X2)llExi(^W)x2(F 2 W)| » |A| 3 /|F| 2 

(X!,X2)eFxF\_E teF 

for any subset E C F x F of bounded cardinality. On the other hand, from the 
Plancherel identity we have 

E Mxi,X2)| 2 HB|/|F| 2 <|A| 2 /|F| 2 

(xi,X2)eFxF 

and 

E |ic(xi,X2)| 2 = |C|/|F| 2 <«|A|/|F| 

(xi,X2)£FxF 

and hence by Cauchy-Schwarz 

E IMxi,X2)||Mxi,X2)| «< |A| 3 / 2 /|F| 3 / 2 . 

(xi,X2)GFxF\_E 

By Holder's inequality, we conclude the existence of xi,X2 € F with (%i,X2) ^ E 
such that 

lE^( F iW)X2(^(t))|»|^| 3/2 /|F| 1 / 2 ; 
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in particular, from the hypothesis (j48|) we certainly have 



(53) \J2xi(Fi(t)) X2 (F 2 (t))\ »>|F| 1 /2. 



(with some room to spare) . As E was an arbitrary set of bounded cardinality, we 
conclude that ((55)) holds for an unbounded number of pairs (xi,X2) € F x F. In 
particular, it holds for a pair (XI7X2) with {xiiXv) 7^ (0,0). 

As is well known, the group F of additive characters of F is isomorphic to F 
itself, and there exists a non-trivial generator xo G F such that any other character 
X € F takes the form x( x ) = Xo{ ax ) f° r some a £ F. (Indeed, if F is a standard 
finite field of characteristic p, one can take xo to be Xo( x ) '■= e 27T ^( x ^ p where <p 
is any linear surjection from F (viewed as a vector space over F p ) to F p , and the 
nonstandard case then follows by Los's theorem.) We may thus write 



for some a, b €E F, not both zero. On the other hand, from the Weil bound on 
character sums (see e.g. [46]) and Los's theorem we have 



whenever xo is a non-trivial additive character and P is a non-constant (external) 
polynomial. We conclude that aFi(t) + 6i<2(i) must be constant, and it is then an 
easy matter to write P in the desired form for Theorem [2lTi) . 

Now we suppose that we are in the case when P has multiplicative structure 
(|47l) . Arguing as in the additive case, we then can find a nonstandard subset 
icF obeying (gSJ) and 



Again, we may assume that Q,F±,F2 are non-constant. By removing a bounded 
number of elements from A, we may also assume that ^ F\(A), F2(A), thus 
Fi(A), F2(A) take values inside the multiplicative group F x := F\{0}. 

We now run the same Fourier analytic argument used in the additive case, but 
with the underlying abelian group used for Fourier analysis being the multiplicative 
group F x rather than the additive group F. Let F x be the space of nonstandard 
(multiplicative) characters on F, that is to say the nonstandard homomorphisms 
ip : F x — > *S' 1 from F x (viewed as a multiplicative group) to the unit circle. By 
repeating the previous arguments with the obvious changes, we conclude that 



Xi (Fi (t)) X 2 (F 2 (t)) = J2 Xo (aFi (t) + bF 2 (<)) 




|^ X o( J PW)|«|F| 1 / 2 



\F 1 {A)-F 2 (A)\<m:\F\V 2 \A\ 1 / 2 . 



(54) 




for an unbounded number of pairs (ipi,ip2) € F x x F x . 
We factor F\ , F 2 as 



Fx = dP" 1 . . . P r °" , F 2 = C2PI 1 --.Pr 



where c\,C2 G F x , P\, . . . , P r are a bounded number of distinct monic irreducible 
polynomials, and a%, . . . , a r , 61, . . . , b r are standard natural numbers. If (oj, . . . , a r ) 
and {pi,..., b r ) are linearly dependent, then P can be expressed in the desired form 
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for Theorem l2TT ii). so suppose that these vectors are linearly independent. We can 
rewrite the left-hand side of (|54j) as 

r 

teF i—l 

By the Weil bound for multiplicative characters (see e.g. [67] Corollary 2.3]), and 
the fact that P\, . . . ,P r are distinct monic irreducible polynomials, this expression 
is OdFl 1 / 2 ) unless ipT^ is trivial for all i = l,...,r. But as (a\, . . . ,a r ) and 
(pi, . . . , b r ) are linearly independent, this can only occur if ipi,ip2 both have order 
at most C, for some standard C. On the other hand, the multiplicative group F* 
of a finite field of order q is isomorphic as a group to the cyclic group Z/(q — 1)Z, 

and so the dual group Fq also is isomorphic to this group. In particular, for any 
k, there are at most k characters in Fq of order k. Applying Los's theorem, we 
conclude the same statement holds for F. Thus there are only a bounded number 
of pairs (ipi,ip2) f° r which ([54]) holds, giving the desired contradiction. 

9. The second algebraic constraint 

We are now ready to establish Theorem [22] (and thus Theorem [3]). Suppose for 
contradiction that we can find F, P which obey the hypotheses of this theorem, but 
do not obey any of the four conclusions (i)-(iv). Applying Theoreml4"01 and Theorem 
|4"51 we see that the only possibility is that there exist geometrically irreducible 
one-dimensional quasiprojective varieties V', W , U' defined over F and dominant 
regular maps / : V — > F, g : W F, h : U' — > F defined over F such that the 
variety 

(55) {(«', w', v!) € V X W X U' I P(f(v'), g(w')) = h{u')} 

is not irreducible. As F has characteristic zero, the curves U', V, W are generically 
smooth, so by removing a finite number of points from each curve we may assume 
that U',V , W are smooth curves. 

Note that if P only depends on the first variable (thus P(x,y) — Q(x) for some 
polynomial x) then we have either conclusion (i) or (ii) of Theorem [22] so we may 
assume that P does not depend purely on the first variable. Similarly, we may 
assume that P does not depend purely on the second variable. In particular, P is 
non-constant and thus dominant. 

We now claim that the varieties 

(56) {(?/, w') eV'xW'\ P(f(v%g(w')) = t} 

are reducible for generic ( e F. Indeed, suppose this were not the case; then (as 
the set of c for which (|5"6"]) is reducible is constructible) this implies that (|55| is 
irreducible for generic t € F. Now consider an irreducible component of the variety 
(l55l) . This can be viewed as the relative product of the varieties 

(57) {(«', w', t) e V x W x F | P(f(v'), g(w')) = t) 
and 

(58) {(u,t) e U X F | h(u') =t} 

over F. As discussed previously, the generic fibres of the first factor (|57|) of this 
relative product are irreducible. Hence, any irreducible component of this relative 
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product must be generically equal to a relative product of (|57p with an irreducible 
component of (|58| . But (|58|) is isomorphic to L7 and is thus already irreducible, so 
that (|55p is irreducible, a contradiction. Thus (|56l) is generically reducible. 

We may rephrase the previous conclusion in the language of linear systems as 
the assertion that the linear system ((V, it/) M- P(f (v'), g(w')) — t) t£ p on V x VF' is 
reducible. We also observe that this system has no fixed components, as the varieties 
()56[) are disjoint as c varies. We may then apply (the generalisation of) Bertini's 
second theorem (see e.g. [UJ Theorem (5.3)]) to conclude that this system is 
composite with a pencil, which means that there is a regular map Q : (V x W'\E) — > 
C from a dense subvariety (V x W')\S of V x W' into an irreducible algebraic 
curve U' and a regular map h : U' — >• F of degree d > 2 (in the sense that the fibres 
k~ x {{t}) generically have cardinality d) such that 

(59) P(f(v'),g(w')) = h(Q(v',w')) 

for generic («', to') £ V' X W'. (Indeed, one can simply take C to be the curve given 
by the Chow coordinates of the irreducible components of generic fibres (|56p , with 
the obvious generically defined maps Q,h.) Note that as P,f,g are dominant, the 
maps h, Q must also be dominant. 

We now begin transferring the base field F to the complex field C in order to 
use the theory of Riemann surfaces. As in Section [3 we may find an algebraically 
closed subfield k of F of finite transcendence degree over Q such that all the varieties 
U',V, W, £ and regular maps P,f,g,h,Q used above are defined over k, and then 
embed k into the complex field C, so that all these varieties can also be viewed as 
complex varieties (with the regular maps being complex analytic maps). 

Henceforth we will work over the complex field. We will exploit the well-known 
fact (see e.g. [35l Corollary 6.10]) that any smooth quasiprojective algebraic curve 
over an algebraically closed field k is isomorphic to an open dense subset of a 
smooth projective algebraic curve over k, or that is to say a projective algebraic 
curve with a finite number of points deleted. Furthermore (see e.g. [351 Proposition 
6.8]), any regular map from an open dense subset of a projective algebraic curve C 
to a projective variety V can be uniquely completed to a regular map from C to 
W. As such, we can view the algebraic curves U',V',W' in the above discussion 
as open dense subsets of smooth projective curves U, V, W respectively, and we 
can also view C as an open dense subset of the projective line P 1 (C). We can 
thus complete the regular maps / : V — > C, g : W — > C, h : U' — > C to regular 
maps / : V -> P\ g : W -> P 1 , h : U -> P X (C). Unfortunately, the regular 
maps P, Q, being defined on the product of curves rather than on a single curve, do 
not automatically extend to the projective completion. However, we may obtain a 
regular extension Q : (V x — > U of Q defined outside of a finite subset S of 

V x W a,s follows. Take the graph 

{(v, w, Q(v, w)) : v eV,w £W, Q(v, w) well defined} C V x W x U 

which is an irreducible constructive set by Proposition [12] Its closure S is then an 
irreducible projective varietj0 of dimension two. Projecting S back down to V x W, 
we see that outside of a subset A of S of dimension at most one, this projection has 
zero-dimensional (hence finite) fibres, with the fibres being at least one-dimensional 



Here we use the fact that the product of projective varieties is (up to isomorphism) again 
projective, see e.g. |49l Lemma 1.6.3]. 
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on A. In particular, the projection S of A to V x W is finite. Outside of E, the fibres 
are finite and generically a single point; a local connectedness argument (using the 
fact that every point in V x W contains arbitrarily small neighbourhoods which 
are connected even when one removes those points in which Q is undefined) then 
shows that the fibres are a single point everywhere in V x VF\E, and so 5 is a 
graph of some function Q : V x W\S —> C. This implies that the projection from 
S\E to V x W\A has degree one (because the generic fibre has cardinality equal 
to the degree in characteristic zero, see e.g. [33] Proposition 7.16]), thus k(S\T,) is 
isomorphic to k(V x W\A). As the variety V x W\A is smooth, it is normal (see 
e.g. [SSI Theorem II. 5.1]) and so fc[S\E] is isomorphic to k[V x W\A]. Thus the 
map (v,w) i-> (v,w,Q(v,w)) is regular on V x TU\E, and so Q is regular on this 
domain as well. 

Similarly, we have a regular extension P : (P : (C) x P 1 (C))\A -> P^fc) defined 
outside of a finite subset A of P X (C) x P^C). By enlarging E if necessary, we may 
assume that (f(v),g(w)) £ A whenever (v,w) £ E. Using these regular extensions 
and (j5"9"]) . we see that 

(60) P(f(v),g(w)) = h(Q(v,w)) 

for all (v,w) € (V x W)\E. 

The smooth projective curves V, W, U , when viewed over C, become compact 
Riemann surfaces, and thus each have a well-defined genus, which is a natural 
number; see e.g. [27]. We now split into several cases depending on the genera 
9vi9w'9u °f tne curves V,W,U (viewed as curves over C). The high genus cases 
will be relatively easy to eliminate, by using existing theorems in the literature that 
limit the number of regular maps available between high genus Riemann surfaces, 
and once we reduce to the case when U has genus zero, we will be able to conclude 
the final option (iii) of Theorem 1221 after some normalisation. Unfortunately, we 
were not able to argue to similarly reduce the genus of V or W to zero, which is 
why Theorem l2"27 iii) still makes reference to curves of arbitrary genus. 

We turn to the details. First suppose that gfj > 2 and g v < 2. In this case, 
we use the Riemann- Hurwitz formula (see |271 p. 219]), which among other things 
implies that there does not exist a non-constant regular map from a Riemann 
surface of genus g to a Riemann surface of genus g' whenever g < g' . We conclude 
that the map (v,w) H> Q(v,w) is constant in v for all w (outside of E, of course), 
which implies by (|59"|) and the dominance of f,g that P(v,w) is a function of w 
only, contradicting our previous hypothesis about P. 

Now suppose that gf, > 2 and g v > 2. For this case, we use a classical theorem 
of de Franchis [15] . that asserts that when two Riemann surfaces U, V have genus 
at least two, then there are only finitely many non-constant regular maps hi, . . . , h n 
from V to U. For each i = 1, . . . , n, the set of w € W such that Q(v, w) — hi{v) 
for all v £ V for which Q(v, w) is well-defined is a closed subset of W, as is the set 
of w € W for which v h-> Q(v,w) is constant. As these sets cover W, we conclude 
that Q is either constant in v or constant in w (outside of E), which implies that 
P is constant in either v or w, leading to a contradiction as before. 

Now suppose that gfy = 1 and g v > 2. Here we can use a variant of the de 
Franchis theorem, namely the theorem of Tamme [62j that for any fixed degree 
d, there are only finitely many non-constant regular maps from V to U of degree 
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at most d. The maps v i— > Q(v,w) can easily be seen to have uniformly bounded 
degree if they are non-constant, and so by repeating the previous argument we 
again obtain a contradiction. 

If gjy — 1 and g v = 0, then we can again use the Riemann-Hurwitz formula to 
show that there are no non-constant regular maps from V to U, and we can argue 
as before to reach a contradiction. 

If gjy = 1, then the previous arguments (together with their counterparts when 
V and W are switched) handle all cases except when g v = g^ = 1; thus U, V, W 
can all be viewed as elliptic curves (after arbitrarily designating one point on each 
of U, V, W as the origin). It is a classical fact (see e.g. [27J P- 238]) that every 
elliptic curve over C has the structure of a complex abelian group, and specifically 
to a torus C/T for some discrete lattice T of C. We can thus form the identifications 
U = C/Ty, V = C/T v , W = C/Tyy on the level of Riemann surfaces (and complex 
abelian groups) for some lattices r^,ry,r^. It is then known (see [35l Lemma 
4.9]) that any regular map from V to U corresponds to a map from C/T v to C/Tjy 
of the form z i— > zq + mz, where zq € C and m is a complex number such that 
mTy C r^. In particular, m is constrained to a discrete subgroup of C (the rank 
of which depends on whether U, V are isogenous, and whether they have complex 
multiplication). For each such m, the set of w E W for which (the completion of) 
the map v i— > Q(v,w) corresponds to a map of the form z t— > zq + mz for some 
complex number zq is a Zariski closed subset of W (this follows from the fact that 
the group operations on an elliptic curve are given by a regular map), and is thus 
either finite or all of W. As W is uncountable (when viewed over the complex 
numbers), we conclude that there is a single m for which the above statement holds 
for all w € W. In particular, this implies that Q(v,w) takes the form 

(61) Q{v,w) =R(v)®fjS(w) 

on the domain of definition of Q, where R : V — > U corresponds to the map z i— > mz, 
and S : W —¥ U is a map (which is necessarily regular, since Q and R are regular). 
Again, we may assume that R, S are non-constant (hence dominant), as otherwise 
P depends only on one of v, w, contradicting our preceding hypothesis. 

At this point we could use Theorem [45] to conclude, but we instead give the 
following more direct argument. We start using the hypothesis that P : P 1 (C) x 
P X (C) — > P X (C) is not just a regular map, but is in fact (the completion of) a 
polynomial, which implies that P(x,y) — oo can only occur if x — oo or y = oo. 
Combining this with (pD| and ([6"Tj). we conclude that R(v) S(w) € /i _1 ({oo}) 
can only occur if v € / _1 ({oo}) orwe g~ 1 ({oo}). As h, f,g,R,S are dominant, 
we thus conclude that there are finite subsets A, B of U such that the only pairs 
(a, b) e UxU with a(B(jb G /i _1 ({oo}) are those with a G A or b e B. But as U is an 
infinite group, this is only possible of fe~ 1 ({oo}) is empty. But h : U — > P X (C) is a 
projective morphism, hence has Zariski closed image (see e.g. [56j Theorem 1.5.2]); 
since h is non-constant, it is therefore surjective, a contradiction. This concludes 
the treatment of the g^ — 1 case. 

The only remaining cases are when g^y = 0, thus U is isomorphic to P 1 (C) (see 
[35l Example 1.3.5]), and so without loss of generality we can take U = P X (C). In 
particular, Q : (V x W)\E P^C) and h : P J (C) P X (C) are now meromorphic 
functions on Riemann surfaces. 
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As h is a non-constant meromorphic function, h~ ({oo}) must contain at least 
one point. Suppose first that /i _1 ({oo}) contains at least two points, which after 
a Mobius transformation we may normalise to be and oo. From (|60j) we see 
that if (v,w) € (V x is such that Q(v,w) — or Q(v,w) — oo, then either 

g(w) = oo or f(v) — oo. Thus, for all but finitely many w, the meromorphic 
function that is (the completion of) v H> Q(v, w) has all of its zeroes and poles in 
/ _1 ({oo}). The order of these zeroes and poles is easily seen to be bounded, so 
there are only finitely many possibilities for the divisor of v i— >• Q{v, w) (the formal 
sum of the zeroes, minus the poles). The set of w for which a given divisor occurs 
is a constructible subset of W, so there must exist one divisor D which is attained 
for all but finitely many w. By Liouville's theorem, any two meromorphic functions 
with the same divisor must differ by a multiplicative constant, so we conclude that 
Q(v, w) — R(v)S(w) for all but finitely many v € V, all but finitely many w £ W , 
and some meromorphic functions R : V -> P X (C), S : W -> P X (C). 

As before, we could use Theorem 05] to conclude at this point, but we will 
again give a more direct argument. Now let v,v' € V be such that f(v) = f{v'). 
Excluding finitely many exceptional pairs (v, v') (including those for which one of 
R(v), R(v') is zero or infinite), we conclude from the above discussion and (|60l) that 

h{R{v)S(w)) = h(R(v')S(w)) 

for all but finitely many w, as S is dominant, this implies that 

h((R(v)/R(v'))z) = h(z) 

for all but finitely many z £ C, and hence for all z E C. Consider the group 
G of complex numbers u such that h(uz) = h(z) for all z € C. As h is non- 
constant, this is a finite subgroup of C x and is thus the iV th roots of unity for 
some natural number N. Then we may write h(z) = h'(z N ) for some regular map 
h! : P X (C) — > P 1 (C), so by replacing h with hi and Q with Q N if necessary we 
may assume that N = 1. (Note that this cannot reduce the degree of h to less 
than 2, since h will still map both and oo to oo.) We conclude that for all but 
finitely many pairs (v,v'), f(v) = f(v') implies R(v) = R(v'). Thus the irreducible 
projective variety {(f(v),R(vj) | v e P 1 (C)} is a graph outside of a finite set of a 
function from P 1 (C) — > P 1 (C); this function is continuous, generically holomorphic 
and blows up at most polynomially at any point, and is thus rational, so that 
R = Y of for some rational function Y : P 1 (C) — > P 1 (C). Similarly we may assume 
that S = Z o g for some rational Z : P X (C) -> P^C). But then from ([BDJ and the 
dominance of /, g we conclude that 

P(x,y) = h(Y(x)Z(y)) 

for all but finitely many pairs (x,y) G C 2 . Since ft, _1 ({oo}) contains 0, oo, we see 
that Y, Z cannot have any poles or zeroes on C and are thus constant, contradicting 
the non-constant nature of P. 

The only remaining case is when h~ 1 ({oo}) consists of a single point, which we 
may normalise to be oo. Then the meromorphic function h maps C to C and is 
thus a polynomial. We have now almost reached the conclusion in Theorem l22f iii). 
with the main thing missing being that the curves V, W are not yet affine, and 
the maps f,g,Q are not yet polynomials. To address this, we use a variant of the 
argument that treated the case when ft, _1 ({oo} contained more than one point. 
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Namely, we observe from (|60|) that for all but finitely many w G W, the function 
i; i y Q(v, w) is defined on all of V, and is finite outside of / _1 ({oo}). This function 
is a meromorphic function on V of bounded degree. Applying the Riemann-RocfP^ 
theorem (see [UJ P- 245]), we conclude that there is a finite-dimensional vector 
space S (over C) of meromorphic functions that only have poles at / _1 ({cx)}), such 
that the functions v h-> Q(v,w) lie in S for all but finitely many w £ W. Letting 
ei,...,e„ : V"\/~ 1 ({oo}) — » C be a basis for this vector space, we thus have a 
representation of the form 

n 

(62) Q(v,w)=J2 z iMei(v) 

i=l 

for all but finitely many w, and all v £ V^\/ _1 ({cx)}). As Q is a regular map and 
the ei are linearly independent, we conclude that Zi are regular maps from W (with 
finitely many points deleted) to C and are thus meromorphic. 

From (|60p . we see that for all but finitely many v, we have w <-> Q(v,w) defined 
on all of W, and finite outside of g~ 1 ({ oo}) . From this , (|62l) , and the linear indepen- 
dence of the ei we see that the Zi are finite outside of Q{v, w). Also one can easily 
verify that the Zi have bounded degree. Applying the Riemann-Roch theorem, we 
can find a linearly independent set of regular maps e' x , . . . , e' m : VF\.g -1 ({oo}) — > C, 
such that each Zi is a linear combination of the , thus we have 

n m 

(63) Q(v,w) =^2^2c ij ei(v)e' j (w) 

for all v outside of a finite subset of V, and all w outside of a finite subset of W, 
and some complex coefficients cy . 

Now let V" C C 1+n , PF C C 1+m be the sets 

V := {(/(«), ei («), . . . , e„(«)) | » G V\r ^{oo})} 

and 

PF := eiH, • ■ • , | «; G ^^({oo})}. 

These are the irreducible projective curves {(/(«), ei(u), . . . , e n (v)) \ v £ V}, 
{(g(w), e'i(w), e' m (w)) \ w £ W} with a finite number of points deleted, and 
so are irreducible quasiprojective curves. If we define the polynomial maps /' : 
C 1+n -*C,g': C 1+m -> C, Q' : C 1+ ™ x C 1+m ^ C by 

f'(z ,zi,...,z n ) :=z 
g'(w ,wi, . . .,w m ) := w 

n m 

Q'((z , z n ), (w , w m )) -=^2^2 

CijZiWj 

i=l 3 = 1 

then /', g' are non-constant on V, W, and we see from (j59)) . (|63[) that 

P(f(v),g'(w))=j(Q'(v,w)) 



"One does not need the full power of the Riemann-Roch theorem here; the finite dimensionality 
of this space can also be established from Laurent expansion at each pole together with Liouvillc's 
theorem. 
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for all v outside of a finite subset of V, and all w outside of a finite subset of W . 
As all functions involved here are polynomials, we may pass to the affine Zariski 
closures V C C 1+n , W C C 1+m , which are affine irreducible curves, and conclude 
that the above identity in fact holds for all v £ V and w £ W . This gives Theorem 
[227 iii) except for the fact that all polynomials and curves here are defined over C 
rather than k; but as P was already defined over k, we may use the nullstellensatz 
(or the Lefschetz principle) as in the proof of Theorem [41] to locate a choice of 
V, W, /', g' , Q' obeying the above properties and also defined over the algebraically 
closed subfield k of C. This (finally!) concludes the proof of Theorem l22l 

Appendix A. The etale fundamental group 

Throughout this appendix, k is an algebraically closed field of characteristic 
zero. (Some portion of the discussion below can be generalised to the positive 
characteristic setting, but several additional technical complications arise in that 
case which we do not wish to discuss here.) 

Given any smooth irreducible variety W over k, and a point p in W, one can 
define an Stale fundamental group w± (W, p) of W at p. The precise construction of 
this group is not particularly relevant for this discussion, but one can for instance 
define tt\(W,p) as the grou p o f automorphisms of the fibre functor <fr H> 4>~ 1 {{p}) 
that maps finite etale covero <j> : V — > W of W to their fibre 4'~ 1 ({p}) (and is thus 
a functor from the category of finite etale covers of W to the category of sets). See 
[2T)l Chapter V] or [501 Chapter 5] for details of this construction. (One can also 
define the etale fundamental group for more general connected, locally noetherian 
schemes, but we will not need this additional level of generality here.) 

In this appendix, we will list some key properties of the etale fundamental group 
that are well established in the literature (such as [2jj||), which we will use as "black 
boxes" . Under the running hypothesis that k is algebraically closed and has char- 
acteristic zero, these properties are largely analogous to properties of the (profi- 
nite completion of the) topological fundamental group of a complex variety. (The 
situation is more complicated in positive characteristic, due to the existence of 
Artin-Schreier coverings, but we will not need to deal with these difficulties here.) 

An important property of the etale fundamental group for us will be topological 
finite generation: 

Proposition 46 (Topological finite generation). Let W be a smooth variety over 
an algebraically closed field k of characteristic zero, and let p be a point in W . Then 
etale fundamental group iri(W,p) is a profinite group which is topologically finitely 
generated, that is to say there is a finitely generated subgroup which is dense in the 
profinite topology. 

Proof. The profiniteness of the etale fundamental group follows from construction 
(see §V.7] or [5UJ Theorem 5.4.2]). Topological finite generation is established 
in [31] Theorem II. 2. 3.1] (and can also be deduced from Theorem [48] below) . □ 



A minor technical point here: in the definition of the etale fundamental group of a general 
locally connected Noetherian scheme, one needs to consider covers that are also as general as 
a locally connected Noetherian scheme. However, it is known that any finite etale cover of a 
quasiprojective variety is again a quasiprojective variety |28l Proposition 5.3.2], and so one can 
work entirely within the category of quasiprojective varieties here. We thank Antoine Chambert- 
Loir for pointing out this subtlety. 
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Given any finite etale covering <ft '■ V — > W of W by a smooth variety V, the fibre 
_1 ({p}) is automatically a finite set, and by construction, the fundamental group 
TTi(W,p) acts on this set, thus each group element g G ni(W,p) will map a point v 
in (f)~ 1 ({p}) to a point gv in 4>~ 1 ({p\). This action is natural in the following sense: 
if one has two finite etale coverings <fi : V — > W and cf)' : V' W and a regular 
map / : V —> V with <fi = <f>' o /, then the actions of iri(W, p) on and 
are intertwined by / in the sense that 

gf(v) = f(gv) 

for all g G -Ki{W,p) and w G <^ _1 (W); see [Ml §V.7] or [60, Theorem 5.4.2]. A 
particular consequence of importance to us occurs when one has two finite etale 
coverings <f>i : Vx — > W, <f> 2 ' V 2 —> W by smooth varieties Vi,V 2 . Then we may 
form the the fibre product (f>i Xw 4>2 -V\ Xw — > W, where 

Vi x w V 2 := {{vi,v 2 ) G Vi x V 2 I <Mfi) = foM} 

and 

01 (f>2(vi,V 2 ) = (pi(vi) = <j) 2 {v 2 ) 
then 4>\ X\y 4>2 is easily seen to also be a finite etale covering, and the action of 
TTi(W,p) on the product fibre (4>\ x w 02) _1 ({p}) = ^^{{p}) x 4' 2 1 ({p}) is the 
direct sum of the action of ni(W,p) on the individual fibres 4>i 1 ({p}), 4> 2 l {{p\), 
thus 

g(vi,v 2 ) = (gv 1 ,gv 2 ) 

for all g G TTi(W,p), Vi G ^({p}), and t; 2 G ^{{p})- 

Givne a finite etale covering <p : V — > W by a smooth variety T^, we can use the 
etale fundamental group n 1 (W,p) to relate the connected components of V with 
the fibre (f>~ 1 ({p\) (and, for the purposes of this paper, this is the main reason 
why need the etale fundamental group in the first place). Indeed, V is connected 
if and only if the action of tti{W,p) on _1 ({??}) is transitive (see [Ml §V.7] or [501 
Theorem 5.4.2]); from this and functoriality, we see that in the more general case 
when V is not necessarily connected, the orbits of ni(W,p) on <^ _1 ({p}) are nothing 
more than the fibres over p of the connected components of V. 

Let W be a smooth irreducible variety, let U be a smooth irreducible subvariety 
of W, and let p be a point in U. Then there is a canonical homomorphism r\ from 
iTi(U,p) to ni(W,p), which is compatible with the actions of these groups on fibres 
in the following sense: if : V — > W is a finite etale covering of smooth varieties, 
and <fi' : _1 (C/) —> U is the restriction of 4> to </> _1 ({/), then <f>' is also a finite etale 
covering, and one has 

gv = r)(g)v 

for all g G ^i{U,p) and all v G ^({p}) = (0') _1 (M); see [29l §V.7]. 

Lemma 47 (Insensitivity to high codimension sets). Let W be a smooth irre- 
ducible variety, let S be a closed subvariety of W , and let p G W\S , and let 
r\ : ni(W\S,p) — ¥ m(W,p) be the homomorphism described above. 

• If S has codimension at least 1 in W , then r\ is surjective. 

• If S has codimension at least 2 in W , then r\ is an isomorphism. 

Proof. See [291 Corollary V.5.6] for the first part, and 29, Corollary X.3.3] for the 
second part. (This second part can also be deduced from the Zariski-Nagata purity 
theorem [30l Theorem X.3.4].) □ 
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If W is a smooth irreducible variety over k, and p is a point in W, then (because 
k has characteristic zero) one can find an algebraically closed subfield k' of finite 
transcendence degree over Q over which W and p are still defined (by taking the 
algebraic closure of the coefficients of the polynomials that cut out W, as well as 
the coefficients of p) . As such, there exists an embedding r : k' — > C of the field 
k' to the complex numbers C; note that in general that this embedding will not be 
unique. Given such an embedding r, we can then define a complex variety t(W) 
by applying t to all the coefficients of the polynomials defining W. Similarly, by 
applying t to p we have a point r(p) in r(W). As W was smooth, we see that 
t(W) is smooth and irreducible in the algebraic sense, which implies by the inverse 
function theorem that t{W) is smooth and connected in the analytic sense. We can 
then form the ordinary (i.e, topological) fundamental group 7r* op (r(W), r{p)). This 
group is, in general, not profinite; however, we can form the profinite completion 
(t(W),t(p)) a , defined as the Hausdorff completion (i.e. inverse limit) of all 
the finite quotients of 7r* op (r(W / ), r(p)). We have the following deep theorem: 

Theorem 48 (Equivalence of etale and topological fundamental groups). Let W 
be a smooth variety over an algebraically closed field of characteristic zero, let p be 
a point in W , and let k' be a field of finite transcendence degree over Q, such that 
W and p are defined over k! . Let t : k' — » C be an embedding of the field k in C. 
Then there is a canonical identification 



between the Stale fundamental group m{W,p) and the profinite completion of the 
topological fundamental group 7t'° p (t(W), r(p)). Furthermore one has the following 
functorial property: if S is an closed subvariety of W defined over k' that has 
codimension at least 1 and avoids p, then the above identifications intertwine the 
canonical homomorphism 



of etale fundamental groups described previously, and the profinite completion 



to the topological fundamental group tt 1 ° p (t(W),t(p)). (It is easy to see that this 
profinite completion exists and is well defined). 

Proof. The first part of the theorem is [29l Corollary XII. 5. 2], which follows directly 
from the Riemann existence theorem ( |29|, Theorem XII. 5.1]), which provides an 
equivalence of categories between finite analytic covering spaces of t(W) and finite 
etale coverings of W, with ir(W,p) being described completely by its action on fibres 
over p in the latter category, and n° p (t(W) , r(p)) A by its action on fibres over r(p) 
in the former category. The second part of the theorem follows from the obvious 
fact that these equivalences of categories for W and for W\S are intertwined by the 
restriction maps from W to W\S. (More generally, one can replace the inclusion 
map from W\S to W by other smooth maps, but we will not need to do so here.) □ 

Remark 49. Among other things, this shows that up to isomorphism, the profinite 
completion of the topological fundamental group 7r' op (r(H / ), r(p)) A is independent 
of r. However, the topological fundamental group itself can be sensitive to the 
choice of r; see [55] . 



nl° p (T(W),T(p)) 



A 



7ri(W\5 s p)->7ri(W s p) 
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We will use the above equivalence to establish the following result which is crucial 
for our analysis: 

Theorem 50 (Weak etale van Kampen theorem). Let W be a smooth variety 
defined over an algebraically closed field of characteristic zero, and let Wi,W 2 
be closed subvarieties of W of strictly smaller dimension. Let p be a point in 
W\(Wi U W 2 )- By Lemma \J7\ we have canonical surjective homomorphisms from 
m(W\(WiUW2),p) to tt\{W\Wi) and iti(W C\Wi) , as well as canonical surjective 
homomorphisms from it i(W\Wi,p) and ~Ki{W\W2,p) to7Vx(W). 

Then iri(W\(Wi U W2),p) surjects onto the fibre product iri(W\Wi,p) ®-ni{Wp) 
7ri(W\W 2 ,p). 

Proof. By Lemma 23 we may remove W\ PI W2 from W without affecting any of 
the fundamental groups, so we may assume that W± and W2 are disjoint. 

Let k' be an algebraically closed field of finite transcendence degree over Q, over 
which W, Wi, W2,p are all defined, and let r : k! — > C be an embedding of fields. By 
Theorem l48l it suffices to show that G^ 2 surjects onto the fibre product G^ xg a GQ, 
where G, Gi, G2, Gyi are the topological fundamental groups 

G:=7Ti(t(W),t(p)) 

G a := tti(t(W)\t(Wi) 1 t(p)) 

G 2 :=tti(t(W)\t(W 2 ),t(p)) 

Gi 2 := 7ti(t(W)\(t(Wi) U r(W 2 ), r(p)). 

On the other hand, by the topological van Kampen theorem (see e.g. [37l §1.2]), 
G12 can be canonically identified with the amalgamated free product of G\ and G2 
over G; in particular, G12 surjects onto G\Xq G2, which implies that the image of 
G^2 in Gf XgA G2 is dense in the relative product of profinite topologies. On the 
other hand, as profinite groups are compact Hausdorff with respect to the profinite 
topology, this image must also be compact, and the surjectivity follows. □ 

Remark 51. Actually, the above theorem is valid in arbitrary characteristic, and 
furthermore tti{W\{Wi U W 2 ),p) is the coproduct (in the category of profinite 
groups) of tt 1 (W\Wi,p) and tti(W\W2,p) over iri(W,p); see [7T]. This result can 
also be deduced from [29j Theorem IX. 5.1]; ultimately, it is equivalent to the fact 
that finite etale covers over and M / \W r 2 which are isomorphic on H / \(Wi U 

W2) can be glued together to form a finite etale cover of W if Wi, W2 are disjoint. 
However, we will not need this stronger version of the etale van Kampen theorem 
here. 
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